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Chapter 
Introduction 



In this introductory chapter, we begin with a brief description of spin glasses in section 
1. We are not physicists. The purpose of this section is to trace the history of the 
models. Section 2 gives a brief summary of the thesis and section 3 recalls certain 
known facts which will be used later in the thesis. 



0.1 Origin of the problem 

The models considered in this thesis have their origin in spin glass theory. Roughly, 
spin glass is a glassy state in a spin system or a disordered material exhibiting high 
magnetic frustration. The origin of this behavior can be either a disordered structure 
(such as that of a conventional, chemical glass) or a disordered magnetic doping in 
an otherwise regular structure. But what is a glass? Loosely speaking, it is a state of 
spins with local ordering (in solid state physics, this is called local 'freezing' - locally, 
the system looks more like an ordered solid rather than a disordered liquid) but no 
global ordering. Spin glass can not remain in a single lowest energy state (the ground 
state). Rather it has many ground states which are never explored on experimental 
time scales. The freezing of the spins, in spin glasses, is not a deterministic one like 
ferromagnetic materials. Rather they freeze in random with some memory effect. 

Experiments show that the susceptibility obtained by cooling the spin glass system 
in the presence of a magnetic field yielded a higher value than that obtained by first 
cooling in zero field and then applying the magnetic field. If the spin glass is cooled 
below Tc (a certain critical temperature) in the absence of an external field, and then 
a magnetic field is applied, there is a rapid increase towards a value, called the zero- 
field-cooled magnetization. This value is less than the field-cooled magnetization. 
The following phenomenon has also been observed in the measurement of remanent 
magnetization (the permanent magnetization that remains after the external field is 
removed). We can cool in the presence of external field, remove the external field 
and then measure the remanent magnetization. Alternatively, first cool with out the 
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external field, then apply the external field and measure the remanent magnetization 
after removing the external field. The first value is larger than the second one. 

The other peculiarity of the spin glasses is its time dependence, which will be 
explained now, that makes it different from other magnetic systems. Above the spin 
glass transition temperature, Tc, the spin glass exhibits typical magnetic behavior. 
In other words, at temperature above Tc, if an external magnetic field is applied and 
the magnetization is plotted versus temperature, it follows the typical Curie law (in 
which magnetization is inversely proportional to temperature). This happens until Tc 
is reached, at which point the magnetization becomes virtually constant. This is the 
onset of the spin glass phase. When the external field is removed, the spin glass has 
a rapid decrease of magnetization to a value called the remnant magnetization, and 
then a slow decay as the magnetization approaches zero (or some small fraction of the 
original value). This decay is non-exponential and no single function can fit the curve 
of magnetization versus time adequately below T^. This slow decay is particular to 
spin glasses. If a similar procedure was followed for a ferromagnetic substance, when 
the external field is removed, there would be a rapid change to a remnant value, but 
this value is a constant in time. For a paramagnetic material, when the external field 
is removed, the magnetization rapidly goes to zero. In each case, the change is very 
rapid and if carefully examined it is exponential decay. 

Behind this strange behaviour of spin glasses, according to physicists, there are 
essentially two major causes. These are quenched disorder and frustration. The term 
"quenched disorder" refers to constrained disorder in the interactions between the 
spins and/or their locations but does not evolve with time. In statistical physics, a 
system is said to present quenched disorder when some parameters defining its be- 
haviour are random variables which do not evolve with time, i.e., they are quenched 
or frozen. This is in contrast to annealed disorder, where the random variables are 
allowed to evolve themselves. Usually the spin orientations depend on several facts 
such as the interactions, external fields and thermal fiuctuations. Their dynamics 
or thermodynamics will suggest whether to order or not. The spin glass phase is 
an example of spontaneous cooperative freezing (or order) of the spin orientations in 
the presence of the constrained disorder of the interactions or spin locations. It is 
thus "order in the presence of disorder". On the other hand, "frustration" refers to 
confiicts between interactions and the spin-ordering forces, and not all can be obeyed 
simultaneously. Frustration arises when pairs of spins get different ordering instruc- 
tions through the various paths which link them, either directly or via intermediate 
spins. The relevance of frustration is that it leads to degeneracy or multiplicity of 
compromises forcing the system to have several ground states. 

Keeping these two in mind, in 1975, S. F. Edwards and P. W. Anderson [21] pro- 
duced a paper, which in the words of Sherrington j37j, at one fell swoop recognized 
the importance of the combination of frustration and quenched disorder as funda- 
mental ingredients, introduced a more convenient model, a new and novel method 
of analysis, new types of order parameters, a new mean field theory, new approxi- 
mation techniques and the prediction of a new type of phase transition apparently 
explaining the observed susceptibility cusp. This paper was a watershed. Edwards 
and Anderson's new approach was beautifully minimal, fascinating and attractive but 
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also their analysis was highly novel and sophisticated, involving radically new con- 
cepts and methods but also unusual and unproven ansatze, as well as several different 
approaches. In their model, two spins interact if they are neighbour to each other. 
The same year Sherrington and Kirkpatrick [38j proposed their model with mean field 
interaction. In this model all spins interact with each other. In both the cases the 
interaction among the spins were random and driven by Gaussian random variables. 
Due to rich and complicated correlation structure among the energy over the configu- 
ration space of the spins, initially the models were not easy to study analytically. To 
get some insight into these models, in 1980, B. Derrida [15] proposed a system without 
any correlation structure over the configuration space. He proposed a solvable model 
called Random Energy Model (REM) for spin glass theory. In REM, all the random 
variables are independent and identically distributed but the distribution depends on 
the number of particles. Like Edwards- Anderson model and SK-model, he considered 
these random variables to be Gaussian. But this is a toy model since the energy of the 
system does not depend on the configuration. Amazingly he could show that though 
this is a very simple model, it exhibits phase transition. 

REM has no correlations at all. But the correlation structure in the Edwards- 
Anderson model and SK-model were very complicated. So the next idea is to study a 
system which exhibits correlations, but their structure is simple enough to explicitly 
solve the model. B. Derrida [17J proposed another model for spin glass theory in 1985, 
by bringing correlations through a tree structure. The tree structure comes from the 
configuration space. Simply put, he identifies the configuration space as the branches 
of a tree. This is called Generalized Random Energy Model (GREM), a generalization 
of the REM. Here also the driving distributions were Gaussian. In this project we 
will focus ourselves on REM and GREM and some related models. 



0.2 Setup and Summary 

For an N particle system with classical spins +1 or —1, a sequence of +1 and -1 
of length N gives a configuration of the system. A typical configuration is denoted 
by (t{N) or by a when N is understood. That is, a is a sequence of +1 and -1 of 
length N. The space of all possible configurations a of a system is called configuration 
space and denoted by S^v or simply by 2^ since S^v is nothing but {-|-1, — 1}^. Now 
depending on the configuration, the system possesses some energy called Hamiltonian. 
For a configuration a, it is denoted by H]^{a). The model is defined through the 
Hamiltonian. So different models have different Hamiltonian structures. In spin glass 
theory, the Hamiltonian is considered to be random. 

When the system is cooled, it settles down at a configuration where the Hamil- 
tonian is minimized. Hence it is very essential to get information about the con- 
figurations where the infimum of the the Hamiltonian is attained and its value. In 
statistical physics one analyzes this problem via the partition function of the system. 
The partition function, denoted by Z^{(3), is defined as follows: 
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Here (3 > is a parameter, represents the inverse temperature. Sometimes when the 
Hamiltonian depends on an external field h, we will denote the partition function as 
Zisi{(3, h). Now note that among all the summands in the above sum if one takes large 
(3, only that summand will contribute where the Hamiltonian attains the minimum 
among all possible configurations. On the other hand, if the focus is on maximum, 
then instead of —jS one has to consider j3 in the exponent. 

But the information in partition function about the minimum energy is in expo- 
nential scale. So it is customary to study the logarithm of the partition function. 
Also the energy of the system depends on the number of particles in the system and 
becomes large when is large. To get some asymptotic result on log Z]\f{i3), one 
has to normalize it properly. In this the correct normalization (in some 

sense). According to statistical physics, —J^logZ^^P) is called the free energy of 
the system. Since one is interested in the asymptotic of the free energy, that is, in 
— 4 lim j^log Zn{P), for mathematical purpose we can forget about the —4 term in 

the definition of free energy. And from now on, we will call lim ^log Z]\f{j3) as the 

N^oo 

free energy of the system. 

In statistical physics, there is another important concept called Gibbs' distribution. 
This is a distribution on the configuration space. According to this, the probability 
of a configuration a is proportional to e^^^^'^'^\ In particular, if GN{(y) denotes the 
Gibbs' probability for a configuration o" G S^v, then 

g-/3//jv((7) 

It is worth noting that, since HN{a)^s are random, the Gibbs' distribution is also 
random. Note that, Gibbs' distribution is so defined as to give maximum weight 
to that configuration which has minimum energy. We shall not deal with Gibbs' 
distributions in this thesis. 

Generalized random energy model (GREM) is one model in this theory proposed 
by B. Derrida [17] in 1985. To describe a version of this fix an integer n > 1. For A^ 
particle system, consider a partition of A^ into integers k{i, N) > for 1 < i < n so 
that ^{h ^) = ^ ■ The configuration space 2^, naturally splits into the product, 

i 

■Q 2'i:(«.^) and a G 2^ can be written as 0102- ■■<^n with Oi G 2^^^*'^^ An obvious 
n-level tree structure can be brought in the configuration space. Consider an n level 
tree with 2*^^^'^^ many edges at the first level. These edges are denoted by ai, with 
0"! G 2'^'^^'^^ In general, below a typical edge aia2 ■ ■ ■ (Ji-i of the (i — l)-th level there 
are 2'^'^*'^^ edges at the i-th. level denoted by cri(j2 ■ ■ ■ o"j_icrj for o".j G 2^*^*'^^ Thus 
a typical branch of the tree reads like cxi (J2 ■■■ cr„ making a one one correspondence 
with 2^, the configuration space. For each i, 1 < i < n and edge (Xi ■ ■ ■ CTj, associate 
a random variables ^(cti ■ ■ -(Tj). All these random variables are i.i.d. A/'(0, A^). One 



5 



Setup and Summary 



non random weight, > for each level is fixed. In GREM, Hamiltonian for a 
configuration cr = o"i • • ■ cr„ is defined as 



When n = 1, GREM reduces to REM, another model proposed as a solvable model 
by B. Derrida [15j in 1980. If ai = 1 then Hamiltonians of REM are nothing but 2^ 
many i.i.d. A/'(0, N) random variables. 

Though it was just a toy model, with correct but heuristic arguments Derrida [T6] 
showed phase transition occurs in REM and in the low temperature the system got 
completely frozen. In 1986, B. Derrida and E. Gardner [18] gave the solution for the 
averaged free energy for GREM and in 1987, Capocaccia et al p] gave a rigorous 
mathematical justification. Indeed, the convergence holds almost surely as well as 
in Lp for 1 < p < oo. In 1989, Galves et al |2l] studied the detailed fiuctuation 
of free energy for both the models and further analysis was carried out in 2002 for 
REM and other models by Bovier et al [7j. In a different direction, Dorlas and 
Wedagedera [20], in 2001 used the large deviation principle (LDP) [TUHl] to study 
the free energy for REM. In the next year, Dorlas and Dukes [19] extended this 
technique to GREM. Though GREM is a little complicated than REM, it is not a 
realistic model for spin glasses. More realistic models were proposed earlier in 1975 by 
Edwards and Anderson [21] (EA-model) through nearest neighbour interaction and 
another by Sherrington and Kirkpatrick [38] (SK-model) by mean field correction in 
the same year. These are the most complicated models in this theory. Though several 
heuristic arguments and conjectures [33] were made and several rigorous results were 
proved [Il|23l[25l[39lll0lllT], it was only in 2002, Guerra and Toninelli [26] showed 
the almost sure existence of the free energy via interpolation technique and convexity 
argument. A discussion of the SK-model using stochastic calculus was initiated by 
Comets and Neveu [12] continued in [TTll^]. For EA-model very little has been known 
till know. In 2003, the idea of Guerra and Toninelli has been generalized to the GREM 
cases by Contucci et al [13J. We thank these authors for clarifying their setup. 

Note that all this analysis was done with Gaussian driving distributions. In 2004, 
Carmona and Hu [9] considered non-Gaussian distributions and showed that the free 
energy of the SK-model does not depend on the driving distribution. Rather, under 
some moment condition on the driving distributions the free energy of SK-model is 
universal (see also [lOJ). It should be noted that earlier already in 1983, Eisele [22] 
considered a class of distributions with exponentially decaying tails for the REM. He 
is the first to identify the relevance of LDP to study free energy for REM. He studied 
completely different types of phase transitions - some kind of iterated large deviation 
phenomena. For the analysis to go through, he assumed the existence of exponential 
moments of all orders for the driving distributions. The last two articles are the 
starting point for this thesis. Now the natural question to ask is, whether there is any 
universality of free energy in REM as well as in GREM? Moreover, is the existence of 
exponential moments of all orders necessary? 

To answer the above questions our first successful attempt [2BJ via LDP argu- 
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(0.2.1) 
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ment was with double exponential driving distributions. In [28] , we provided negative 

answer to the above questions. First of all, considering H^^ to be i.i.d. double expo- 
nential driving distribution with parameter 1, we show that the nontrivial free energy 
is different from that of the Gaussian REM. Though the Hamiltonian does not depend 
on A^, it is interesting, the system exhibits phase transition. Secondly, note that in 
this case Ee^^^ does not exists for t > 1. Here in the first chapter, we extract the 
essence of the argument in [28] and state as 

Theorem 0.2.1. Let {Xn} satisfies LDP with a strictly quasi-convex rate function 
1{x). For a.e. uj, the sequence of empirical measure {^n{uj)} of 2^ i.i.d. random 
variables having law Xn satisfies LDP with rate function J given by, 



We apply this theorem to the known Gaussian case [T6l[20l[M] , as well as to double 
exponential case and further to WeibuU type exponentially decaying tail distributions. 
We also show that the energy in REM is not distribution specific rather rate specific. 
In the compact distribution section we give some partial results when there is no 
non-trivial rate function for the driving distributions. In the concluding section, we 
apply the above theorem to discrete distributions - Poisson and Binomial. There we 
show that even the existence of phase transition depends on the parameter of the 
underlying distributions. For example, if the Hamiltonian H^ify) is Binomial with 
parameter N and p, phase transition takes place only when p> \. 

For GREM, once again our first attempt [29] was with the double exponential 
driving distributions along with the LDP arguments [19]. The original formulation 
of GREM in the literature is slightly different from the formulation we mentioned 
above. In the second chapter, we start with a discussion of this reformulation. Then 
we bring a general tree structure in GREM and prove a basic fact which is used in 
the analysis of this chapter as well as for several other models considered in the next 
chapter. The details are in chapter 2. Briefly, we consider trees all of whose branches 
extend up to n-th level. Let Bi^ be the total number of edges at the i-th level and 
Bn be the number of leaves of the tree. Let s'j^ be the sum of the squares of the 
numbers of leaves at the n-th level below each edges at the ith level. If ^ denotes a 
random variable having the common distribution of the ^(o"i(T2 ■ ■ ■ cXi), then we have 
the following. 



Theorem 0.2.2. Lei A = Ai x ■ ■ ■ x A„ C M'". Denote qtN = P(C e A^) for 

1 < i < n. 




7 



Setup and Summary 



//" Tl, BiNqiN ■ ■ ■ QiN < oo, for some i,l < i < n then a.s. eventually, 

N>n 



/xjv(A) = 0. 



2 

b) If for alii = 1, ■ ■ ■ ,n, ^ < oo, then for any e > a.s. eventually, 

N>n * 



(1 - e)E/i^(A) < fiN{A) < (1 + e)E/i^(A) 



In section 2.4, we use this result for GREM with a general family of driving 
distributions. For fixed 7 > 0, we consider the driving distributions of ^(o"i ■ ■ ■ <7i) 
having density 



1 /7\^ ^ 



^N,j{x) = — y- - 00 < a; < 00. (0.2.2) 



'1' 



Note that when 7 = 2, this is the Gaussian case. We discuss this case systematically 
in section 2.5 and bring out the essence of this model. Here it is. 

For each j, 1 < j < n, we have a sequence of probabilities {Ajy,A^ > 1} on M 
satisfying LDP with a good, strictly quasi convex rate function Ij and ^((Xi • ■ - cTj) ~ 
A]y. Define for each u, fi^i^) to be the empirical measure on M", namely. 



a 



where S (x) denotes the point mass at x G M". 

Theorem 0.2.3. Suppose ^ p^. > g for 1 < j < n. Then for a.e. to, the 

sequence {/iAr(co'), > 1} satisfies LDP with rate function J given as follows: 

j j 

Supp{J) = {(xi, ■■■ ,Xn) ■■ Yl ^k{xk) < E Pfclog2 for 1 < j < n} 

k=l k=l 

and 



J{x) 



Ik{xk) if X e Supp{J) 

k=l 

00 otherwise. 



This result, with the help of Varadhan's Integral lemma [^[T^ . reduces the prob- 
lem of free energy to merely calculation of certain infimum. In section 2.6, we solve 
this variational problem for general n and produce the explicit energy expression in 
the case of 7 > 1 and 7 = 1 by different arguments. Further, for 7 > 1, we characterize 
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the energy function for GREM and show that the energy function is continuous in 7. 
For < 7 < 1, we only give the energy expression for n = 2. The beauty of the above 
theorem (Theorem I0.2.3I) is that, it allows us to consider different distributions at 
different levels of the under lying tree. This we considered in j30j and here in section 
2.7. Even the simple case, n = 2 the model exhibits a lot of interesting phenomena. 
For example, consider a 2 level GREM with exponential driving distribution at the 
first level and Gaussian in the second, and give equal weights at the two levels, that 
is, tti = a2- Then even if p2 = 0.00001 (very small) the system reduces to a Gaussian 
REM. On the other hand, if we consider a 2 level GREM with Gaussian driving dis- 
tribution at the first level and exponential in the second, the system will never reduce 
to a Gaussian REM. Moreover, in either case, the system will never reduce to that of 
an exponential REM. 

In the third chapter, we randomize the underlying trees. To keep the same num- 
ber of furcations for all edges at given level, for fixed A^, we take one Poisson random 
variable at each level to determine the number of furcations. We called this model 
as regular Poisson GREM. On the other hand, it is possible to keep the number of 
furcations depend on the edge. In other words, for each edge we can associate a Pois- 
son random variable to determine the number of furcations for this edge. This we 
called Poisson GREM. We discussed multinomial variation also using results from |27j . 
These are all different methods to randomize the tree. Note that the configuration 
space is no longer 2^ . These models are interesting and Theorem 10.2 . 21 above is power- 
ful enough to handle these models. However, in all these cases the free energy remains 
same as in the usual GREM. Whether there are other interesting tree structures that 
exhibit peculiar phenomena is not clear to us. As far as our knowledge goes, the 
GREM with randomized (or even nonrandomized but general) trees is not discussed 
in the literature. 

In 2006 Bolthausen and Kistler [3] proposed a model (BK-GREM) bypassing the 
ultrametricity in the configuration space. Even in this model, they have shown that 
the energy of the system is again a suitable GREM energy. In section 3.4, we provide 
a proof via LDP arguments. Then in section 3.5, we construct n\ many GREMs, 
one corresponding to each permutation of the set {1, 2, ■ ■ ■ , n} by manipulating the 
weights from BK-GREM. We characterize a class of permutations so that (1) the 
corresponding GREM energy will be the same for all the permutations in that class, 
(2) this energy is the minimum over all possible n\ many GREMs and (3) this is the 
energy of the BK-GREM. Bolthausen and Kistler [5], have shown that the energy of 
BK-GREM is the infimum over GREM energies corresponding to all possible chains. 
Our analysis shows that instead of considering all chains, one needs to consider n! 
many GREMs. This is still a large number. We conclude this chapter, by defining 
one model, called block tree GREM, where the free energy is maximum of all possible 
n! many n level GREMs, rather than minimum as in the BK-GREM. 

In the last chapter, we introduce a new model, word GREM. This brings out the 
crucial role played by LDP in all the earlier models. Here we start with a distribution 
having finite mean and Xn denotes the law of the sample mean (of size A^). Then 
Cramer's theorem ^14j suggests that the sequence {Xn} satisfies LDP with a convex 
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rate function given in terms of the Fenchel-Legendre transformation of the starting 
distribution. We consider a set of n symbols / = {^i, ^"2, ■ ■ ■ , and take 5* to be any 
collection of finite number of words formed by these n symbols. As earlier, we consider 
N) >0, l<z<nasa partition of so that the configuration space 2^ splits 
as nr=i 2''^^'^\ For s = ^j^^^jj ' ' ^ ^ a configuration a = (cr\ ■ ■ ■ , a"') where 
a' e 2'=(*'^\ we denote a{s) = {a'\a'\--- ,a''). For s e S, let A' be a probability 
on M having finite mean. Let denote the distribution of the mean of the first 
N random variables of an i.i.d. sequence with common law A'^. For the N particle 
system, we have the following. For each s E S and each a G 2^, we have a random 
variable ^{s,a{s)). These are independent random variables. For fixed s, they are 
identically distributed and the common distribution is A|^. Then for a configuration 
a = (cTi, ■ ■ ■ , CTjv) G 2^, we define the Hamiltonian in word GREM as 

N 

Hn{ct) = Nfi^a)) + hY,^,. (0.2.3) 

i=l 

where / : ^ M is a continuous function, ^(cr) = (^(s, cr(s)))sg5 and /i > is the 
intensity of the external field. 

We present a large deviation proof for the existence of the free energy for this 
model and apply the analysis to known |T6] REM with external field. 

This model includes REM, GREM and BK-GREM and may perhaps include mod- 
els truly more general than these. Further, it allows external field. Moreover, different 
driving distributions can be used at different words in the collection. 

In this project, we did not consider the analysis of Gibbs' distribution. With Gaus- 
sian driving distribution, there are several results for REM [^H] and for GREM [5,6J. 
See also [35l[3T| [32 | [36] . For exponential driving distribution we verified that for REM, 
in the high temperature regime, Gibbs's distribution converges to the uniform distribu- 
tion pS] where as in the low temperature regime it converges to the Poisson-Dirichlet 
distribution. This is similar to that of Gaussian REM. So is it true for any other 
distributions considered in this thesis? Since we do not have anything substantial to 
say regarding this issue, we have not considered. 



0.3 Large Deviation Terminology 

Recall that 

where E^r is expectation w.r.t uniform probability on 2^ space. And hence, 

4 log Zm (/?) = log 2 + 4 log E.e-^'^^ . 
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The last term in the above equation is well known expression in the Laplace's 
principle. It is indeed 



where jj^ is the uniform probability on the space 2 . The only trouble is are 
on different spaces. If we transport at to M by the map a i— > ^^-^j^, then we will 
arrive at exactly the Laplace type situation, where Varadhan's integral lemma comes 
to rescue. Since ifjv depends on cu, the transported probability will be random. So 
the application of LDP needs careful attention. 

Since there are several terminologies (using e or using etc) for large deviations, 
we fix our terminology now and recall some known facts. Let X be a Polish space. 

Definition 0.3.1. A function X : X — ^ M is called a lower semicontinuous function if 

for any a e M, the set {x : I{x) < a} is a closed set. It will be called good if the set 

{x : 1{x) < a} is a compact set. 

The following two properties of lower semicontinuous function are worth mention- 
ing. 

Proposition 0.3.1. Let f he a lower semicontinuous function. Then for any x, 



Proposition 0.3.2. Let f be a good function and be a sequence of closed sets 

so that Fn+i C Fn for every n and C^nF^ — {xq\. Then 



Definition 0.3.2. Let {^jln} be a sequence of probabilities on X. {//at} is said to 
satisfy large deviation principle with rate function X if 

1. T : X — > [0, oo] is a lower semicontinuous function, 

2. for any Borel set B, 




sup 

G: neighbourhood of x 



inf /(l/) = f{x). 



f{xo) = lim inf f{y). 



inf T(x) < liminf — log//jv(S) < limsup — log//jv(S) < — mf_T(x). 

x&BO N N N N xeB 
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Further, if for < a < oo, the set {x : T{x) < a} is a compact set, then X is called a 
good rate function. 

A sufficient condition for the existence of LDP is the following: 

Proposition 0.3.3. Le^ X he a Polish space. Let A he an open haseforX. Let {hn} be 

a sequence of probabilities on X. For each A E A, let L^{A) = — liminfjv jflogiiN{A) 
and L*{A) — — limsupjv log iin{A). Suppose for every x eX, 

sup L^{A) — sup L*{A)—I{x) (say). 
xeAeA xeAeA 

Assume moreover that either {/Ujv} is eventually supported on a compact set or the 
sequence is exponentially tight, that is, given any a < oo, there is a compact set K 
such that limsup^ j^\og^N{K'^) < —ol. 

Then the sequence {/xat} satisfies LDP with rate function I. 

The next proposition is a variation of well-known Varadhan's integral lemma, 
which will suggest that only we have to calculate some infimum to get the free energy 
limit. 

Proposition 0.3.4. Suppose the sequence of prohahilities {/in} on a Polish space X 
satisfies LDP with rate function I and, moreover, /in o-f^ eventually supported on a 
compact set C. Then for any continuous function / : X — > 

hm i-log / e-^^d/.^ = - mi{f{x)+I{x)}. 

AT— »oo iV J xEC 

We need the following known as Cramer's Theorem. 

Theorem 0.3.5. Let Xi, X2, ■ ■ ■ be i.i.d. real valued random variables with EXi < 00 
and Ajv be the law of their sample mean. Then the sequence of probabilities {Xn} 
satisfies LDP with a convex rate function I given by 

I{x) = sup{9x - log^e^^i}, (0.3.1) 

V 

where V ^ {9 : logEe'^^^ < 00} C M. 
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The following is also true. 

Lemma 0.3.6. Let I be as in the above theorem. If T>x — {x : T{x) < oo} and 
X — EXi < oo, then 

1. X{x) = 0, 

2. X is strictly decreasing on {x <x} H T>x, 

3. X is strictly increasing on {x < x} H Vj. 



Chapter 1 

The Random Energy Model 



In this chapter we discuss a toy model of spin glass theory, called Random Energy 
Model (REM). In the literature [16lll2], this is driven with Gaussian distributions. 
In [22], Eisele discussed the model with more general distributions, particularly with 
regularly varying distributions and the relevance of large deviation methods in this 
context. We study the model with other types of distributions. For instance, the 
driving distributions could be exponential or more generally Weibull. Or they could 
be compactly supported etc. Our discussion mainly rehes on the idea of Dorlas and 
Wedagedera. In [20!, they first used the large deviation techniques to get the asymp- 
totics of the free energy. 

After defining the model in the first section, we give a general large deviation 
result in the section 1.2 and apply the results in REM with diverse distributions in 
section 1.3. In section 1.4, we give partial results with compact distributions where 
we could not use the large deviation results. We conclude this chapter by considering 
the model driven by some discrete distributions. 



1.1 Setup 

In this model, proposed originally by B. Derrida [15], for each N, the Hamiltonian 
H^i^a) are independent over a G Sjv. Derrida considered them to be centered Gaus- 
sian with variance A^. In spite of the simplicity of this model, in [TB] , he showed the 
existence of phase transition. Using the entropy energy equation, he evaluated the 
limiting annealed free energy limjsf j^E log Z^^P) and showed that for low tempera- 
ture, that is, for (3 large, free energy becomes linear in f3. It is known that, in fact, 
j^logZ^iP) converges a.s. [2D]- 
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1.2 Main Results 



Let us consider a sequence of probabilities {\n,N > 1) on M". Assume that {Xn} 
satisfies large deviation principle (LDP) with a strictly quasi-convex good rate function 
I{x). An extended real valued function /, which may take the value +00 but not —00, 
defined on a convex set will be called a strictly quasi-convex function if for any two 
distinct xi and X2 in {f{x) < 00} and for each 6 G (0, 1) we have f{Oxi + (1 — 0)x2) < 
ma.x{f{xi),f{x2)}. For every A^, let ^i, 1 < i < 2^ be i.i.d. random variables (M" 
valued) with distribution Aat. These random variables, of course, depend on A^ but to 
ease the notation we are suppressing their dependence on N. For every sample point cu, 
we define iin{'^) to be the empirical measure on R", namely /iAr(<^) = 2^ ^ ^ 
Here 5 (x) denote the point mass at x. Now we are ready to state our first theorem. 

Theorem 1.2.1. For a.e. lu the sequence {ij,n{u!)} is supported on a compact set and 
satisfies LDP with rate function J given by, 



Proof. Step 1: Let A be an open subset ofW. //X)2^A7v(A) < 00, then almost 
surely eventually /iAr(A) = 0. 

Indeed, using P for the probability on the space where the random variables are 



J{x) 



I{x) «/J(x)<log2 



00 



^/T(x) >log2. 



defined. 



P(//jv(A) > 0) = P{^i e A for some i) < 2^Ajv(A). 



Now Borel - CanteUi completes the proof. 



Step 2: Let A be an open subset o/R". // ^ 



< 00, then for any e > 0, 



almost surely eventually 



(1 - e)Ajv(A) < /XAr(A) < (1 + e)Aiv(A). 



Indeed, 



Var 
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Now Chebyshev yields 

P{MA) - A^(A)| > 6A^(A)} < ^22nI^^^^ 
and Borel-Cantelli completes the proof. 

Since X is strictly quasi-convex, the set {X(x) = log 2} does not contain any line 
segment, we can choose a countable open base 05 such that for every A e 03 either 
An {I{x) < log2} = or A n {I{x) < log 2} 7^ 0. For instance, we could choose 03 
to be the collection all open boxes such that (i) X value at a corner point is different 
from log 2; (ii) each co-ordinate of a corner point is either rational or ±00. 

Step 3: Let I{x) > log 2. Then almost surely, sup {— liminf log/xjv(A)} as 
well as sup {— lim sup log// jv( A)} are 00. 

Since I{x) > log 2, pick Aq G 03 such that x e Aq and Ag n {X(x) < log 2} = 0. 
Then limsup log Aiv(Ao) < — iiif I{y) = —L < —log 2. Fix a > such that 
—L < —log 2 — a. Then for sufficiently large N, -^logAjv(Ao) < — log 2 — a, that 
is, A7v(Ao) < 2-^6-^°'. In other words, 2^A7v(Ao) < e'^"' for all large N. Thus by 
Step 1, a.s. eventually ij,n{Aq) = and hence the claim. 

Incidentally, this also shows the following. If K is the compact set {x : T{x) < 
log 2} then consider a bounded open box A e 03 such that K d A. Clearly, A^ is 
union of 2n many boxed from 03. The above argument shows that is a.s. even- 
tually zero for each of these 2n boxes. This shows that the sequence {/iAr} is a.s. 
eventually supported on a compact set, namely, A. 



Step 4: Let I{x) < log2. Then almost surely, sup {— liminf -i: log//iv(A)} as 

well as sup {— lim sup log //at (A)} are 

xeAeQS 

Fix A e 03 such that x e A. Then liminf log A7v(A) > - inf X(y) = -L > 
— log 2, where the last inequality is a consequence of the strict quasi-convexity of X. 
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Fix a > so that —L > — log2 + a. So for large N, -^logXNi^) > ~^og2 + a, 
that is, A7v(A) > In other words, 2^A^(A) > e^". Now use Step 2, for any 

e G (0, 1), eventually 

(1 - e)A;v(A) < ^;v(A) < (1 + e)A^(A). 

Hence by definition of LDP, we have eventually 

-J(A) < liminf i-log/iiv(A) < limsup ^ log/i^(A) < -J(A), (1.2.1) 

where as usual T{A) = inf T(x). 

From the first part of the above inequality we have, 

sup {-liminf ^log/i^(A)} < sup X(A) < J(x). (1.2.2) 

Moreover, for every A G ^ such that x G A using the right side inequality of 
ffLO) . we have 

limsup^log/iAr(A) < -X(A). 

N 

Let QSa; = {A;,. G 53 : A; > 1} be a subclass of ?B so that A^^;^ C Aj. for every k 
and rifcAfc = {x}. Then 

sup {— limsup — log/iiv(A)} > sup 2r(A) 

Ae33:a;eA N Ae^:xeA 

> sup 1(A) 

A£<B^ 

= liniX(A,) 

k 

= I{x). (1.2.3) 



The last equality follows as X is a good lower semicontinuous function (see Proposition 
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From ffT:2:2|) and ffTX3|) . it follows that 



sup {— liminf — logyU7v(A)} = sup {— limsup — logyUAr(A)} = X(x). 



Now proof of Theorem 11.2.11 is completed by appealing to Proposition 10.3.31 and 



Remark 1.2.1. The fact that T is a good rate function is essential in the above theorem 
to conclude that almost surely eventually the sequence {/iAr(ti;)} is supported on a 
compact subset of M". 

Remark 1.2.2. Observe that the strict quasi-convexity of the rate function in the above 
theorem is a technical assumption. On real line that assumption can be replaced by 
the assumption: I is strictly monotone on {x : I{x) G (0, oo)} or by the assumption 
that {x : I{x) = log 2} is a nowhere dense set. Such a condition is needed only to 

ensure that there exists a countable base as mentioned in the above proof. 

The implication of the above theorem in REM is amazing. To see this, let us 
assume that {Xn} is a sequence of probabilities on M and satisfies large deviation 
principle with a good rate function X. Let us assume also that X be strictly quasi- 
convex or satisfies any one of the conditions in remark 11.2.21 For fixed N, let us 
consider 2^ i.i.d. random variables ^(cr), 1 < a < 2^ distributed like A at. We can 
identify these 2^ many a with the elements of S^r = {+1,-1}. Let us define the 
Hamiltonian for cr G Sjv to be 



where E^j is the expectation with respect to uniform probability on the S^r space. 
Hence 



By Theorem 11.2. H the induced probabilities {^in} are a.s. eventually supported 
on a compact set. That is, for almost every u, there is a compact set K^^ such that 
{fi]\r{uj), > 1} are all supported on K^. Moreover, by previous theorem they satisfy 



observing that {^n} is eventually supported on a compact set. 



□ 



Now note that the partition function can be written as 
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LDP with a good rate function a.s. Hence to find the existence of hniTv jj logZ^iP), 
we can use Varadhan's integral lemma with any continuous function, in particular, 
f{x) = jSx on M. This will lead to the following: 

Theorem 1.2.2. // the sequence of probabilities {Xn} satisfies LDP with strictly 
quasi-convex good rate function I, then almost surely, 

hm llogZ;v(/?) = log2 - inf (px + I{x)). (1.2.4) 

N^oo JM {X(x)<log2} 

Thus in the REM, the existence of limiting free energy is just a corollary of large 
deviation principle. To get the the expression of the free energy one has to solve the 
variational formula. Hence the calculation of asymptotics of free energy reduces to 
calculation of the above infimum. 

Remark 1.2.3. In the literature [T6ll20[|42j . for REM, the Hamiltonian Hn{(t) is defined 
as V^^(cr), with ^(a) ~ A/'(0, N). In our case with the Gaussian driving distribution 
it is same as N^{a) where ^(cr) ~ A/'(0, j^). But the large deviation technique allows 
us to consider to be any continuous function of ^(cr). In other words, if f{x) is 
a continuous function on M then one can define the random Hamiltonian H]^{a) = 
A^/(^(cr)) where ^(cr) ~ Xjy. In that case, the above theorem will reduce to 

Theorem 1.2.3. If a sequence of probabilities {Xn} on M. satisfies LDP with strictly 
quasi-convex good rate function I and H^i^a) = Nf{C,{a)) where ^(cr) ~ X^ and f is 
a continuous function on M, then almost surely, 

hm ^logZ^iP) = log 2 - inf (/5/(a;) + J(a;)) . (1.2.5) 

N^oo iV {X(a;)<log2} 

Of course, the appearance of / above makes it more general, but this could be 
obtained from (11. 2. 41) by contraction principle of large deviation techniques. Different 
choices of functions / allows us to consider the Hamiltonian driven by other distribu- 
tion which can be obtained as a function of known distributions. For instance, if we 
consider f{x) = x"^ then we can get the information of the model when its Hamiltonian 
is an appropriate if A at as M{0, j^). Also we can consider several other functions, 
for which we do not know the corresponding closed form expression of the distribution 
of the Hamiltonian. For example, f{x) = xcos (10007r|x|l^l) etc. 



Remark 1.2.4. A close look at inf {f3f{x) +X(x)) suggests the following: If we 

{X{a;)<log2} 
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consider the Hamiltonian of an A^-particle system to be an odd function / of random 
variables ^(a) ~ Xjy and if the sequence {X^} satisfies LDP with a quasi-convex 
good rate function taking value at the origin, then the contribution for the limiting 
free energy lim jf log Zn{(3) comes from only that part where the function /(x) is 

A'' — >oo 

negative. More precisely, / being an odd function /(O) = 0. Since X(0) = 0, infimum 
in fll.2.5p is non-positive. So only points x where f{x) < need to be considered 
while calculating the infimum. Thus, we have, almost surely, 

hm 1 hgZ^{(3) = log 2 - inf {I{x) + , 

N^oo I\ {X(a:)<log2}- 

where < log 2}^ = {I{x) < log 2} fl {f{x) < 0}. For example, when /(x) = x 

we have the following 

Corollary 1.2.4. If f{x) = x and I{0) = 0, then almost surely, 

lim 4 log ZnIP) = log 2 - inf (I(x) + px) . 

N^oo N ° ^' ^ ° {I{x)<\og2}- 

One can see that the contribution to the free energy is only from the negative values 
of the random variable. This can be made precise as follows: Let {Xn} and {i^n} be 
two sequences of probabilities satisfying LDP with a good strictly quasi-convex rate 
functions Ii and I2 respectively so that Xi(0) = X2(0) = and Ii{x) = X2(x) for 
X < 0. Then consideration oi C,n ^ Xn or C,n ^ will lead to the same limiting free 
energy. In other words, symmetry of the random variables does not play any role in 
the evaluation of limiting free energy. To illustrate, if we consider the density of A at 
given by, 

i./fe-^^-' forx>0 
M^) = {^ , (1.2.6) 

E^Nx for X < 

then from the discussion of our next section, it will be clear that this sequence {X^} 
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satisfies LDP with rate function 



I{x) 



Y for X > 0, 
—X for X < 



and hence {1{x) < log 2} = [— log 2, v^2 log 2]. Here the distribution of Xn is of 
Gaussian form in the positive part of the real line whereas on the negative part it 

is of exponential nature. If ^at ~ Atv, then as inf (^x"^ + Px) > the above 

corollary will reduce to 

Corollary 1.2.5. If Xn has density (pN given by U.2.6\) . then almost surely, 
lim — log Ziy(p) = \og2 — inf (—x + Px). 

N^oo N {-log2<x<0} 

Hence the Gaussian part of the random variables does not contribute to the lim- 
iting free energy. Similarly, if we consider the density of Ajv to be 



K(.-Nx for X > 

M^) = { ^ , (1-2.7) 

forx<0 



then the rate function will be 



X{x) 



X for X > 



ix^ for X < 0. 



In this case Corollary 11.2.41 will reduce to 

Corollary 1.2.6. If Xn has density (pN given by ^.2.1 ), then almost surely, 



lim ^ log Zn{P) = log 2 - inf ( Jx^ + /5x 
Here the exponential nature of the random variable on the positive side does not 
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play any role. 

Remark 1.2.5. Suppose that the sequence of probabilities {\n} is supported on [0, oo) 
and satisfies LDP with a quasi-convex rate function X with X(0) = 0. For example, 
fix a number 7 > and put X(x) = x'^ for a; > and 00 for a; < is such a rate 
function. It follows from Corollary 11.2.41 that, when f{x) = x, then almost surely 

lim 4 log Zn(P) = log 2 - inf (I(x) + px) . 

N^oo N ° ^ ' {X(x)<log2}- 

As the sequence {Xn} is supported on non-negative real line, X(x) = cxd for x < 0. 
Hence {I{x) < log 2}- = {I{x) < log 2} n {f{x) < 0} = {0} and 

lim 4log^^(/5) =log2 

N—^00 iV 

almost surely. In this case, the system will not show phase transition. 

The examples given above are rather artificial. Of course, there are natural ex- 
amples of random variables ^(a) whose distributions satisfy large deviation principle 
with a good convex rate function. In the following sections, we discuss some examples. 



1.3 Distribution with exponentially decaying Tail 

In this section, we consider the driving sequence of distributions {Xn) such that for 
X > 0, Xn[—x, xY ~ e~^^'^ for some 7 > 0. 



1.3.1 Gaussian Distribution 

Our first natural example is the Gaussian distribution well studied in the literature [Ml 
[3ll[20lll2j. Let Aat be the centered Gaussian probability with variance That is, 

having density W , for —00 < x < 00. It is obvious that Aat ^ as — 00. 

The following is well known. It can also be obtained from Cramer's theorem 10.3.51 
Since the proof is simple, we give it. 

2 

Proposition 1.3.1. The sequence {Xn} satisfies LDP with rate function X = ^ on 
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Proof. Let A C M be an open interval. Let m = inf |a;|, M = sup |a;|, and qn = 
AAr(A). With this notation, note that, we have 

Qn <'2\ — / e -2 dx < e 2 dx < e ^ dx < e 2 , (1.3.1) 

V 27r J^m Jy/Nm V Nm 

with the understanding that when = 0, the last expression is ^ and 



> ^ / e-'^dx > - / e-^(ix > l^e-^^"^"-^^ , (1.3.2) 

for any < 5 < M — m. 

From above two inequalities, we can conclude that for any open interval A, the 
limit, lim -^logAAr(A) = — Once again. Proposition 10.3.31 completes the proof. 

□ 



2 

Remark 1.3.1. Note that, here I{x) = ^ is a continuous function with compact level 

sets. Not only that, it is a convex function and hence quasi-convex. 

As a consequence of Theorem II. 2. 3[ if for cr G S^v the random variables ^(a) ~ Aat 
and the Hamiltonian H]^{a) = Nf{^{cr)) with any continuous function / on M, we 
get the following: 

Corollary 1.3.2. If \n ~ A/'(0, -^), then almost surely, 

lim 1 log Z^{[3) = log 2 - inf (f3f{x) + ^ 

N^oo N x2<21og2 \ 2 

Taking f{x) = x, we will get the classical case where the Hamiltonian for 
A'"-particle system Hj^ is Gaussian with mean and variance N. Note that, 

inf ( l3x + ^) = inf ( ^ — (3x]. Let us denote the function g(x) = ^—jdx 

so that g'{x) = x — (3 and g"{x) = 1 > 0. Therefore, ai x = j3 the function g at- 
tains its infimum. So as long as /? < ^2 log 2, the inf — I3x] is attained 

0<x<v'2 log 2 V / 

ai X = [3. Moreover, as is a decreasing function on [0,/5], for [3 > \J1 log 2 the 
inf ( ^ — /5x ) is attained at x = a/2 log 2. Hence we get the following 

0<x<v'21og2 V / 
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Theorem 1.3.3. If HN{a) are independent //{O, N) , then almost surely, 



lim^logZ^(/3) = log2 + f z//3<v/2bp 



As we mentioned at the beginning, this is classical. 



1.3.2 Exponential Distribution 

Another simple but interesting example is the exponential distribution. Let Aat be 
two sided exponential probability with scale parameter j^. That is, having density 
^A^e"^'^'', for — oo < a; < oo. Once again, it is obvious that A^v ^ as A^ ^ oo. Now 
we show the following 

Proposition 1.3.4. The sequence {Xn} satisfies LDP with rate function X = \x\ on 
R. 

Proof. Let A C M be an interval. Let m = inf{|x| : x G A}, M = sup{|a;| : x G A}, 
and gjv = AAr(A). With this notation, we have 

qN = - [ e-^l^lrfx < r e-^dx > e"^™, 

2 J A J'/Nm 

and 

i-NM pN{m+5) 

qN> e-^rfx> / e-''dx> N5e-^^"'+^\ 

J Nm J Nm 

for any < S < M — m. 

From above two inequalities, we can conclude that for any interval A, the limit, 
lim -i:logA7v(A) = —m. Once again. Proposition 10.3.31 completes the proof. □ 

Remark 1.3.2. As in the Gaussian case, here also, note that, T{x) = \x\ is a convex 
continuous function with compact level sets. 

Corollary 1.3.5. If ^n{<^) independent (over a) two sided exponential variables 
with scale parameter j,, f is a continuous function on M and H]^[a) = A^/(^(cr)) then 
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almost surely, 

lim ^logZ^(/5) = log2 - inf (pf{x) + \x\) . 

N-*oo iV |i;|<21og2 

If we take f{x) = x, the Hamiltonian for A^-particle system Hn is, of course, 
two sided exponential random variables with scale parameter 1, that is have density 
|e~l^l on R. In that case, for the limiting free energy, we only need to calculate 
inf (b| + that is, inf — A simple calculation yields the following 

|a;|<21og2 0<a:<21og2 

Theorem 1.3.6. If Hn{(t) are independent two sided exponential random variables 
with scale parameter 1, then almost surely, 



lim^logZ^(/?) = log2 if 13 < I 



N 



(5\og2 if(5>l. 



Remark 1.3.3. Interesting observation in this analysis is that, the random Hamilto- 
nian, for A'"-particle system being exponential random variables with scale parameter 
1, does not depend on the number of particles. Even then, the system shows a phase 
transition. 

1.3.3 WeibuU Distribution 

A more general class that can be considered are the WeibuU distributions. Let A^v be 
the probabilities on M having density 

4'N,'y{x) = ■, — OO < X < OO. (1.3.3) 

This is known as WeibuU distribution with shape parameter 7 > and scale parameter 

(-^)^. Clearly, ^at^i is the usual two sided exponential density, considered in the 
previous subsection. We show that. 

Proposition 1.3.7. //Ajv has density ^tsi,-i, then {Ajv} satisfy LDP with rate function 

X = ^ on R. 

7 

Proof. Let A C R be an interval. Let m — inf{|x| : x e A}, M — sup{|x| : x e A}, 
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and qn = AAr(A). With this notation, we have 

/ |a;|^-ie~^— rfa; < / e-'^dx < / e"^ = e"^" dx, (1.3.4) 
JA 2 Jn^ Jn^ 

7 7 

and 

,rM7 (m + g)T 

1 r^— 1 r^—T^ 5 -, Af(!ri±^)z 

qN>7: e-'^dx > - e-'^dx > -N{m + ^(5)T"^e-^^^, (1.3.5) 

2 J nulL 2 J j^elL 2 



7 7 



for any < 5 < M — m and some 6', < ^ < 1. Mean value theorem is used here. 
The above two inequahties imply that for any interval A C M, lim log Ajv(A) = 
Thus Proposition 10.3.31 completes the proof. □ 

Remark 1.3.4. In this case, rate function X(x) = ■'^ is a continuous function with 
compact level sets. But T{x) is convex, only when 7 > 1. Note that, for < 7 < 1, 
1{x) is not convex but clearly quasi-convex and hence Theorem 11.2.21 is applicable. 

Corollary 1.3.8. If C,n{<^) (ii"^ independent (over a) having density (j)N,-y, f is a con- 
tinuous function on M and Hi\f{a) = Nf{C,N{c)) then almost surely, 

lim 1 log Z^iP) = log 2 - inf (f3f{x) + M 

Af^oo A* |x|7<7log2 y 7 

As earlier, if we take, f{x) = x, the Hamiltonian for A^-particle system 

is a two sided WeibuU distribution with shape parameter 7 and scale parameter 

i 7-1 

N 1 . In this case the problem of limiting free energy reduces to the calculation 
of inf f ^ + (3x) , that is, inf - (3a 

|x|7<log2 V -T ^ J' , V T ^ 

' ' - * 0<x<(7log2)7 

For 7 > 1, to calculate the above infimum, we imitate the Gaussian case. Let us 
denote g{x) = ^ — (3x so that g'{x) = x^~^ — (3 and g"{x) = (7 — l)x'^~^ > on 

[0, (7 log 2) 7]. So g being twice differentiable convex function, the infimum of g will 

attain where g' = 0. But g' will be on [0, (7 log 2) 7] only when f3 < (7 log 2) 7. In 

1 1 
that case, the infimum will occur at x = /?t-i. When P > (7 log 2) 7, then ^f' < on 

[0, (7 log 2) 7] and hence infimum occur at x = (7 log 2) 7. 

For 7 < 1, the function g{x) = ^ — fix = x'^ (^^ — l3x^~'^^ is a product of two 
functions. Here x"' is a positive increasing function on [0, (7 log 2)"]. On the other 
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hand, ^ — Px^ is a decreasing function taking the value ^ at 0. If this function 

always remains positive then clearly the minimum of is attained at a; = 0. On the 

other hand, if this function takes negative value in [0, (7 log 2)^] then the infimum of 

g is attained at (7 log 2) ~. This situation occurs only when - — jdx^^"' = for some x 

1 _i 1 — f 

in [0, (7 log 2) t]. This happens only when /3 > 7 T(log2) t . Hence the infimum of 
1 _i 1 — f 1 

g on [0, (7 log 2) -'J is attained at a; = for /? < 7 T(log2) t and at x — (7 log 2)')' 

_ j_ _ 1— 7 
for /3 > 7 T (log 2) T . 

We can combine the above arguments as 

Theorem 1.3.9. // {H]\f{a), a G Sat} are independent having two sided Weibull 
distribi 
surely, 



i 7-1 

distribution with shape parameter 7 > and scale parameter 'yy N -> , then almost 



1 iog2 + 3:^/?7-i z//5< (7log2)-, 

|^(7log2)^/3 ^//?> (7log2)^ 

z/7 > 1 
and 

1 log 2 < 7-7(log2)-V^ 
lim-logZ^(/3)= <^ 

N J\ I 1 1 1 — I 

(7log2)"/3 > 7~"(log2)~~ 
^/7<l• 

Remark 1.3.5. It is easy to verify that, if A^v has density. 



')N,'y{x) — Const. e t — 00 < x < 00, 



more precisely. 



(t>N,'y{x) = ,1. 7 N-ie t - 00 < X < 00 

then {Xn} satisfies LDP with rate function l{x) = ^\x\'^. Note that here 7 = 2 is 
the Gaussian distribution. Hence in REM, if we consider Hn to ^{0, N) or two sided 
Weibull distribution with shape parameter 7 = 2 and scale parameter \/2N, they 
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will produce the same limiting free energy. So the limiting free energy of REM is not 
entirely distribution specific, but it is 'rate-specific'. 



I. 4 Compact Distributions 

In the previous section we observed that, for the existence and evaluation of free 
energy, we concentrated our attention on the set {x : T{x) < log 2}. That is, the 
entire support of the random variables are not contributing to the system. To do that, 
we used a variant of Varadhan's lemma. In general Varadhan's lemma is applicable 
to the class of bounded continuous functions. In our case the functions used in the 
previous section are rather unbounded. As suggested by Proposition 10.3.41 if the 
underlying sequence of probabilities are eventually supported on a compact set, we 
can overcome this little technicality. Our assumption that the rate function X is a good 
rate function will ensure that the sequences of induced probabilities are almost surely 
eventually supported on a compact set. Since X is a good rate function {X(x) < a} 
is a compact set for every a G M. In particular, {X(x) < log 2} is a compact set. For 
example, if {\n} is a sequence of probabilities satisfying LDP with rate function X so 
that X(x) < log 2 for all x G M, then we may not be able to apply Varadhan's lemma 
to get the free energy of the system. In particular, if X(x) = for all a; G M, we 
can not infer anything about the existence of the free energy of the system by large 
deviation techniques. 

To start with, let us note that, if X takes two value and oo so that {X(a;) = 0} 
is a compact set, say C. In this case, in view of Remark ll. 2. 2^ we can apply Theorem 

II. 2.31 with f{x) = (3x. This will ensure the almost sure existence of the limiting free 

energy and is equal to log 2 — inf (3x = log 2 — Pxq, where Xq = mi{x : x G C}. Note 

c 

that, if C C [0, oo) then clearly the limiting free energy becomes negative for large (3 
if ^ C whereas if G C then it will be just a constant, log 2. So we will not get any 
phase transition here. 

Though we do not have a clear picture when X is identically 0, we have some 
partial results. First of all, note that X(x) = for all x iff lim log Aiv(A) = 

for every open subset A o/M. This follows from definition of LDP. 

Now let us consider the case, when the Hamiltonian is supported on a compact set. 
For each A^, let A at be a compactly supported symmetric probability with density (pjy 
and {^Ar(o") : o" G Sat} be independent random variables having density (p^. Consider 
the Hamiltonian 

H^{a) = mNia). 

Let [—a^, Oat] be the support of ^^r. Let us assume that ajsf a as N ^ oo. Here we 
allow the possibility that a = oo. For s > 0, denote aN^s) = P{^7v(c") > s}. Note that 
in this setup if {Xn} satisfy LDP with rate function I{x) = 0, then j^\ogaN{s) 
as — > oo for < s < a. 
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The following theorem suggests that if the tail probability does not decay expo- 
nentially fast over N, then we can not expect any annealed phase transition. 

Theorem 1.4.1. Let [~aN,C(N] be the support of C,n and for s > 0, denote aAr(s) = 
P{^n{(^) > s}. If ^ o. and log aAr(s) — > as N oo for < s < a, then 

lim ^E\ogZN{P) = hg2 + ai3. 

Proof. As 

the proof for a = is immediate. Moreover, in this case, for every sample point 

lim i-logZ;v(/3) = log2. 

N^oo iV 

So let a > (may be a = oo). Since log is concave, by Jensen's inequality 

E\ogZM{f3)<\ogEZN{f3). (1.4.1) 
As Hn are bounded by Nun, 

Hence, by assumption and fll.l.ip . 

limsup^ElogZ^(/3) <log2 + a/3. (1.4.2) 



Now we show. 



liminf ^ElogZiv(/9) >log2 + a/3. (1.4.3) 

N—>oo A 



For that, with arbitrary but fixed < s < a, let Xn = ^^{o" : HN{a) > sN}. Then 
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EXn = 2^aN{s) and EXj^ = 2^(2^ - l)a%{s) + 2^a^(s). Hence 

E{Xn - EXnY = EXl - {EXnY < 2^aiv(s). (1.4.4) 

If Xn < 2^-^aN{s) then EXn - Xn > 2^-^aN{s) so that {Xn - EX^f > 
2^^~^a%{s). Let = {Xn < 2^-ia^(s)}. So An C {{Xn-EXn)^ > 2^''-^a%{s)}. 
Hence, by Markov inequahty and ( 11.4.41) . 

P(A ^ ^ E{Xn^EXn)^ 4 



22^-2a^(s) - 2^aN{sy 
i.e., P(A^) > 1 - 21^. But on A^, 

ZN{f3) > XNe"'"" > 2''-'aN{s)e^'^, 



and hence 

E [logZ^(/3)U^] > [{N - 1) log 2 + loga^(s) + PsN] (^1 - ^w^) ■ (^-^-S) 

Now An = {Xn = 0} U {1 < X^v < 2^-ia„(s)}. Since Zjv(/5) > 2^6"^^"^ and 
PP(X^ = 0) = (1 - a7v(s))2'^ we have 

^[logZ^(/?)l|x^=o}] > {N\og2-(3NaN){l-aN{s))''"'. (1.4.6) 
On {1 < < 2^-^aN{s)}, log Zn{(3) > f3maxHN{cr) > f3sN > and hence 



E [logZ7v(/?)l{i<Xjv<2^-iaiv(s)}] > 0- 



(1.4.7) 
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Thus from f03D . ffLTO and ffTXTll we have 

+ [\og2- f3aN] {I- a^{s)f\ 



N 



By assumption, logajv(s) ^ so that 2^aAr(s) oo and hence (1 — aAr(s))^ 
as iV — >^ oo. Thus, under the assumption. 



hminf ^ElogZiv(/3) > log 2 + /3s. 



Since < s < a is arbitrary, we have 



hm inf log Zn{P) > log 2 + a/3 
N—*oo Jy 



which remain true even when a = oo with the understanding that the right side of 

the above inequality is oo. 

This completes the proof. □ 

Since has density (pN with support [—aN,aN] and Hj^ = N^j^, the support of 
will be [— T/v, T/v] where Tj^ = Na^. If we assume, v?7v be the density of Hj^ with 
support [—T/v, T/v], then we can apply the above theorem with a/v = 

The following examples will illustrate the applications of the above theorem. 

Example 1.4.1 (Uniform Distribution). Let fNix) = 2^1[-Tjv,Tjv]- If > 0, 

then a]\f{s) — » > for all < s < a. So utilizing the above theorem we get, 

I 

a) if T/v = then lim —log Zn{(3) = log 2 for every sample point, 

b) if Tn = N then hm ^E log Zn{P) = log 2 + /?, 

c) if T/v = A^^ then lim ^ElogZ^^p) = oo. 
Similar remarks follows for the other examples also. 

Example 1.4.2. Let 5 > be fixed and 

fNix) = -^^^{Tn - \x\)\ -Tn<x<Tn. 
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If ^ ^ a > 0, then ajv(s) | {^Y^^ > for all < s < a. 
Example 1.4.3. Let 



1 X ttTn ^ ^ ttTn 

09/v(x) = — — COS — — , < X < . 



If ^ ^ a > 0, then ajvi-s) ^ |(l - sin ^) > for all < s < af . 
Example 1.4.4. Let 



If ^ — > a > 0, then aAr(s) | as oo for all < s < a. 
In the following examples, above theorem is not applicable. 

Example 1.4.5 (Truncated Double Exponential). Let 



1 

2(1 ~e~T^ 



Let ^ ^ a(> 0) as iV ^ oo. Then a^(s) = Hence ^^^^^ ^ -s ^ 

as ^ oo for all s with < s < a. Thus we can not apply the Theorem 11.4.11 any 
more. However, if H^^a) has density v^Ar(x) and Aat is the law of j^Hi^{a) then if 
«(> 0), by analysis of subsection ll.3.2[ we can easily see that the sequence 
{Aat} satisfies large deviation principle with rate function X given by. 



Ik 

N 



X{x) 



\x\ for |x| < a 
oo otherwise. 



Hence we can use Theorem II. 2. 3l to conclude that the free energy will be same as that 
of exponential REM as long as a > log 2. Where as if a < log 2 then almost surely, 



1 I log 2 forO</5<l 

lim — logZjv(/3) 

'\og2-a + /3a for /3 > 1. 
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Example 1.4.6 (Truncated Gaussian). Let 

Oat 

Let ^ ^ a(> 0) as AT oo. Then i2I^^ ^ -is2 ^ o as oo for all s with 

< < a. Thus we can not apply the Theorem 11.4.11 once again. However, if 
HN{a) has density v^Ar(x) and \n is the law of j^Hi^{a) then if — > a(> 0), by 
analysis of subsection 11.3. H we can easily see that the sequence {\n} satisfy large 
deviation principle with rate function 1 given by, 



J(x) 



for < a 



oo otherwise. 



Hence we can use Theorem ll.2.3l to conclude that the free energy will be same as that 
of Gaussian REM as long as a > log 2. Where as if a < log 2 then almost surely, 



1 I log 2 + \I3^ for < /3 < V2a 

lim — logZ^(/3) 



log 2 - a + /3V2a for j3 > V2a. 



1.5 Discrete Distributions 

We conclude this chapter by considering the REM driven by some discrete distribu- 
tions. 



1.5.1 Poisson Distribution 

Let us consider the Hamiltonian for the particle system HN{a) ~ P{N9) where 
P{N9) is the Poisson distribution with parameter N9. Let Aa? be the law j^P{N9). 
We also can think of A^r as the law of the sample mean for a sample of size from 
P{9). Then by Cramer's theorem (Theorem 10.3.51) . {Xn} satisfies LDP with convex 
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good rate function T given by 

= / ^-^ + 3^1ogf forx>0 _ 

1^ oo otherwise ' v • • / 

Hence by Theorem ll.2.3[ if for a E T,n the random variables ^ (a) is distributed hke 
Aat and the Hamiltonian Hjy^cr) = Nf{C,{a)) with any continuous function / on M, 
we have the following: 

Corollary 1.5.1. If \n ~ ^^c'^ almost surely, 

hm i- log Z^iP) = log 2 - inf + J(x)) . 

N-^co iV X(x)<log2 

Notation: Note that here X is a convex continuous function on [0, oo) so that X(0) = 6; 
T{0) = and — oo as x — oo. So the set {x : X(x) = log 2} contains only one 
point when 6 < log 2; contains zero and one non-zero-point for 6 = log 2; contains two 
positive points for 6 > log 2. As a consequence, the set < log 2} is an interval 

[xi,X2]; Xi = in case of < log 2 where as Xi > in case of ^ > log 2. In any case, 
9 e (xi,X2). 

Hence when f{x) = x the above corollary implies that 

Mm log ZN{f3) =log2- inf + 6^ - x + xlogf } 

N [xi,X2] 

= \og2-e- inf {(/?- l)x + xlogf} . 

[2:1,2^2] 

To calculate the above infimum, let g{x) = (/3 — l)x + xlog | on [0, 00). Clearly, g is 
a convex function. g'{x) = P + log |, so that g'(x) = implies x = 6'e~^. Hence g at- 
tains its infimum at x = 6e~^. We consider two cases, namely, 6 < log 2 and 6 > log 2. 



I < log 2 I 

For /3 > 0, < 9e^^ < 9 < X2. That is, the point x = 6'e~'^, where 
g attains minimum, belongs to G [xi,X2] for every (3 > 0. Hence inf g{x) = 

[xi,X2] 

{p - l)9e-^ - 9e-noge-^ = -9e~'^ 



I > log 2 I 

As P increases from to 00, 9e~^ decreases from 9 to 0. Since < Xi < 6*, there 
exists Po > such that 

^e"* = xi. 

Clearly, for P < Pq, 9e~^ G [xi,X2] so that inf g{x) = —9e^^. Since g attains 

[xi,X2\ 



Chapter 1: The Random Energy Model 



34 



its infimum at x = 6e~^, g is increasing (by convexity) on (x, oo). For j3 > jSo, 
X = 6e~^ < 6e^^° = xi. Thus g is increasing on [a;i,a;2]. As a consequence, when 
P > Po, we have inf g{x) = g{xi) = [3xi +T(xi) — 9 = [3xi + log 2 — 6. 

All this leads to 

Theorem 1.5.2. Consider REM where the Hamiltonian HN{a) is Poisson with pa- 
rameter NO. 

a) For 9 < log 2, almost surely, 

lim^logZjv = log2-^ + ^e"^ /or /3 > 0. 

h) For > log 2; let Xi he the least positive solution o/x(log| — \) = 6 — log 2, 
and (3q = log Jj- = — 1. Then almost surely, 



lim^logZ^ =log2-^ + ^e-^ for P < Po 
= Pxi for P > Pq. 

Now if we take f{x) = —x, then by Corollary I1.5.H almost surely, the limiting 
free energy is given by 

lim 4 log Zn (P) = log 2 - inf {I(x) - px] 

= log 2 -61- inf (xlog^ - (p + l)x\ . 

X(x)<log2 I J 

To calculate the above infimum, let g{x) = a;log| — (/? + l)x on [0, oo) so that 
g'{x) = log |— /5 and g"{x) = ^ > on (0, oo). Hence g attains its infimum at x = 
Note that, x = 9 for P = and x — > oo as /? ^ oo. So there exist Pi > such that 
XiOe^^) = log2, that is, 9e^^ = X2- So the infimum inf jxlog f — (/9 + l)x| 

X(a:)<log2 ^ ' 

occurs at Oe^ for P < Pi and at X2 for > Pi leading to the following 

Theorem 1.5.3. In REM, if the Hamiltonian Hj~^{a) negative of Poisson with pa- 
rameter NO, then almost surely 

lim^logZ^ =\og2-e + ee'^ for P < pi 
= pX2 forp>Pi. 
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1.5.2 Binomial Distribution 

Let X]sf ~ B{N,p) where B{N,p) is the Binomial distribution with parameter p{0 < 
p < 1). Put ^jv = Observe that when p = or 1, then the Hamiltonian is 

deterministic one and uninteresting. Let be the law of Thus ^jv is nothing but 
the proportion of heads in tosses of a coin (with chance of heads p). By Cramer's 
theorem (Theorem 10.3.51) . {Xn} satisfies LDP with convex good rate function X given 
by 

, , fx log - + (1 - x) log for X G [0, 11 
T(x) = { ^ ^ & i-p L > J (^^ 5 2) 

I oo otherwise. 

Note that here X is a strictly convex continuous function. Now fix a continuous 
function / on M. Consider N particle system with Hamiltonian Hn = A^/(^jv(ct)). 
By Theorem ll.2.3[ to calculate the limiting free energy we only have to solve the 
optimization problem 

inf (/?/(x)+T(x)). 

I(z)<log 2 

Note that here X(0) = — log(l — p); 1{p) = and X(l) = — logp. So the set 
{x : I{x) = log 2} = {0,1} when p = | otherwise the set {x : X(x) = log 2} is a 
singleton. Let us denote the set {X(x) < log 2} as [xi, X2] where 0<Xi<p<X2 = l 
for p > |; = Xi < p < X2 < 1 for p < I and = Xi < p < X2 = 1 for p = |. 

When /(x) = x, by Theorem ll.2.3[ we have almost surely, 

1 1 X 1 — X 

lim — logZ7v(/3) = log2 — inf </5x + xlog — 1-(1 — x)log 

N N [xi,X2] { p 1 — p 



To calculate the above infimum, let g{x) = (3x + x log | + (1 — x) log on [0, 1] 

x{l-p) 
{l-x)p 



Clearly g is a convex function and g'{x) = f3 + log f . So g attains its infimum at 



p 

x{(3) given by x{(3) = -. — r. We now consider two cases. 

p+ [l +p)eP 



As [xi,X2] = [0,X2] where X2 > p and x(/3) < < p for every /3 > 0, we have 
x(/3) G [xi, X2] = [0, X2]. Hence on [xi, X2], g attains its infimum at x(/?) and by routine 
algebraic manipulations, we get, inf g{x) = /5x(/5) +X(x(/3)) = P — log{p+{l—p)e^). 



[xi,X2] 



Since x{(3) decreases from < p to as /5 increases from to cxd and < Xi < p, 
there exists /3o > such that 

x{l3o) = xi. 

Hence as x(/3) G [xi,X2] for (3 < I3q, g attains its infimum at x{(3) on [xi,X2] and we 
get inf g{x) = [3 — \og{p+{l—p)e^). On the other hand, for /5 > /5o, being a convex 

[xi,X2] 
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function and attains it infimum at x < Xi, g is increasing for x > x. Hence g attains 
its infimum on [xi,X2] at xi leading to inf g{x) = g{xi) = j3xi+I{xi) = /9xi + log2. 

[xi,X2] 

All this leads to 

Theorem 1.5.4. In REM, if the Hamiltonian H^la) is Binomial with parameter N 
and p, then almost surely, 

a) forp< \, 

lim log = log 2 - /? + log(p + (1 - p)e^) for f3 < 0. 

b) forp> \, 



limilogZjv =log2-/3 + log(p+(l-p)e^) for (3 < (3o 
= -pxi for (3> (3q. 

On the other hand, if we take f{x) = —x, then by Theorem 11.2.3^ almost surely, 
lim — log Zn(P) = log 2 - inf {X(x) - px} 

N N I{x)<\og2 

{X 1 — X 
X log — h (1 — x) log (3x 
p 1 — p 

where we use the same notation for xi, X2 as in the case for /(x) = x. 

To calculate the above infimum, let h{x) = x log | + (1 — x) log ^zf ~ on [0, 1] 

so that h'{x) = log^^^^ — /3 and g"{x) = ^^-^^^^ > on (0, 1). Hence g attains its 

p 

infimum at x(j3) = — ^. Note that, x(0) = p and x(j3) — > 1 as /5 — > oo. So 

p + [1 — p)e P 

the infimum inf h{x) is attained at x_{(3) for every /3 > for p > I and for 13 < Pi 

for p < \, where /?i > is such that X(x(/3i)) = log 2, that is, x(/3i) = X2- For p < \ 
and P > Pi the infimum inf h{x) is attained at at X2- To be more precise, we get 

[xi,X2] 



the following: 



Theorem 1.5.5. In REM, if the Hamiltonian Hn{(t) is negative Binomial random 
variable with parameter N and p, then almost surely 
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a) forp> \, 

lim ^ log = log 2 + /5 + log(p + (1 - p)e"^) /or /9 > 0. 

b) forp< \, 

lim^logZjv =log2 + /3 + log(p+(l-p)e-^) for P < fii 

= 13x2 for (3 > Pi. 
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Chapter 2 



The Generalized Random Energy 
Model 



In the random energy model (REM) [T^fTB] of Derrida, the Hamiltonians in dis- 
tinct configurations are independent. The idea in generahzed random energy model 
(GREM) is to bring an amount of dependence in the Hamiltonians through the struc- 
ture of configurations. Of course, very little can be achieved by assuming an arbitrary 
covariance matrix. To introduce hierarchy, an n-level tree structure was suggested by 
Derrida [T7], where the branches of the tree are in correspondence with the configu- 
ration space. In this chapter we discuss this model with some modifications. There 
are two essential differences from what is usually considered in the literature. First, 
we provide a general framework of trees. However, they will be considered in the 
next chapter. Second, we split the number of particles N into n groups rather than 
splitting the number of spins (or 'factorizing' 2 as is customary in the literature). This 
allows us to introduce a further randomization at the tree level, like Poisson trees and 
multinomial trees. These will be consider in chapter 3. 

In this chapter, we specialize to the driving distributions having exponentially 
decaying tails. The basic inequalities lead to the large deviation principle (LDP) for 
the random probabilities as in the case of REM considered in the previous chapter. 
This leads to an explicit formula for the free energy. For the exponential GREM, 
the driving distribution does not depend on the number of particles. This does not 
make it less interesting. In fact, the Gaussian case is no more complicated than the 
exponential case. The present treatment clearly brings out the similarities between 
the two cases. There are dissimilarities too. As expected, for small values of (3 (inverse 
temperature), the energy function in the exponential case does not depend on (3 where 
as for the Gaussian it is quadratic in (3. In the Gaussian case, all the weights associated 
with all the levels of the tree participate in the expression for free energy, where as in 
the exponential case it is not always so. 

Even though for any finite number of particles, we have a truly n level tree, in 
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the limit, it may collapse to a lower level tree - it may even correspond to REM 
(see remarks 12.6.11 and I2.6.5p . This leads to the notion of reduced GREM. For such 
models, the energy function determines all the parameters of the model. It is also 
possible to characterize the energy functions. It is interesting to note that in the 
SK-model, subject to certain moment conditions of the underlying distribution, the 
energy function is universal [9], while it is not true here. 



2.1 Derrida's Model 

Let us first describe the model in detail. As a generalization to his REM [16], in 
GREM [17] Derrida introduced a tree-like structure in the energy levels. This is 
what we now explain. Fix a positive integer n > 1. This n will be the level of the 
tree. For each level z = 1, 2, ■ ■ ■ , of the tree, fix number aj so that G (1, 2) and 
nr=i '^i ~ 2- For fixed A^, in the tree, there will be many nodes at the first level. 
Below each of the first level nodes, there will be many nodes in the 2nd level. 
Hence, there will be a total of (aia2)^ many nodes at the 2nd level. In general, at 
the ith level, there will be af many nodes below each of the {i — l)th level nodes 
giving a total (aia2 ■ ■ ■ many nodes in the ith level. So at the n-th, that is, 
last level there will be (aia2 ■ ■ 'Ctn)^ = 2^ many nodes (leaves). Derrida associates 
the configuration space S jy with the all possible branches from root to leaves of the 
above tree. Since there are 2^ many configurations, he assumes nr=i ~ 2- 
define the Hamiltonian, he associates an independent random variable to each edge 
of the tree. For i = 1, 2, ■ ■ ■ , n there are (aia2 ■ ■ ■ ctj)^ independent Gaussian mean 
zero random variables with variance aiN associated to each of the ith level edges. 
Here ai, 02, ■ ■ ■ , a„ are positive numbers so that ^27=1 = 1- The Hamiltonian for 
a configuration, that is, for a branch from root to a leaf is the sum of the n random 
variables associated with the edges constituting the branch. So the partition function, 
in this model, reduces to 



n=l j2={n-l)a^+l j2={in-i-l)a^+l 



In the entire explanation above, we pretended that each af is an integer. But is 
this possible? - No. One way out is to consider [af]. Being the number of edges, 
each af has to be an integer which divides 2^ because (aia2 ■ ■ -(yn)^ = 2^. By the 
fundamental theorem of arithmetics, = 2'^'^*'^^ for some positive integer k(i,N). 

Moreover, k{i, N) fori = !,■■■ , n is such that k{l, N)+k{2, N)^ ^k{n, N) = N. In 

other words: given any tree with 2^ leaves the construction allows only for furcations 
in powers of 2 at each layer. This was also noted in [13]. To eliminate the confusion 
regarding whether is an integer or not, we made the natural modification to the 
model in the next section. 



41 



A Reformulation 



2.2 A Reformulation 

Wc formulate GREM as follows. As above, fix an integer n > 1. Let N > n he 
the number of particles, each of which can have two states/spins +1,-1; so that 
the configuration space is 2^. Consider a partition of N, into integers k{i, N) for 
1 < i < n with each k{i,N) > 1 and '^k{i,N) — N. The configuration space 2^, 

i 

naturally splits into product, Yl 2*^*^*'^^ and a E 2^ can be written as aia2 • • • (t„ with 
(Tj e 2'^(*'-^^. An obvious tree structure can be brought in the configuration space. 
As earlier imagine an n-level tree. There are 2^*^^'^^ nodes at the first level. These 
will be denoted as ci, with ai G 2^^*^^'^^ Below each of the first level nodes there are 
2*^(2.-^) nodes at the second level. The second level nodes below cti of the first level 
will be denoted by (Ti(T2 with (T2 G 2^^'^'^\ In general, below a node 0102 ■ ■ ■ of 
the (i — l)-th level there are 2'^(*'^) nodes at the i-th level denoted by 0^02 ■ ■ -ai^iUi 
for cTj e 2*^*^*'^). Thus a typical branch of the tree reads like (Ji(72 • • • cr^. Obviously the 
branches are in one one correspondence with 2^, the configuration space. At the node 
(Ji • • • (jj, we place a random variables ^(cti • • • CTj). We assume that all these random 
variables are i.i.d. with a symmetric distribution. We associate one weight for each 
level, say weight Oj > for the i-th level. These are not random. In a configuration 
a — (7i - • ■ (Tn the Hamiltonian is 

n 

HN{(j)^Y.ai^{a^---ai). (2.2.1) 

i=l 

For /3 > the partition function is 

Z^{(3) = e-^^~(^) = 2^E^e-'^^^(^). (2.2.2) 

a 

Here stands for expectation with respect to cr when the configuration space 2^ 
has uniform distribution. In other words, Eq- is simply the usual average over a. 

Since ^'s are random variables both and Zn are random variables. We suppress 
the parameter uj. As usual j^\ogZ]si{l3) is the free energy of the iV-particle system. 
This is the object of study. As N changes, the common distribution of the ^'s would 
in general change and so in ifjv- 



2.3 Tree Formulation 

We now reformulate the setup as a general tree structure. Though most of the trees 

that we consider later are regular in the sense that the number of furcations 

of a node depend only on its level, and not on the particular node the present 

formulation is general. It allows randomization of the tree, which we do consider later 
in the next chapter. 
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Let n > 1 be fixed integer as earlier. For each N > n, let T^r be a tree of height n 
with each branch extending up to the n-th level, ai denotes a typical node at the first 
level and in general below a node cricr2 ■ ■ ■ o'j-i of the (i — l)-th level, (Ti(T2 - ■ ■ (^j-iCTj is 
a typical node at the i-th level. We shall now define some useful quantities associated 
with the tree. Let S^v be the set of all branches (Ji(T2 • • • (7„ of the tree T^. Let 
denote the number of nodes at the i-th level. In particular, is the total number 
of branches of the tree, which will simply be denoted hj B^. For a node (Ti(T2 ■ ■ ■ o"j of 
the i-th level, let e((Ticr2 ■ ■ ■ denote the number of nodes at the n-th level below the 
node aia2 ■ ■ ■ (Ti. Equivalently, e(cri(T2 ■ ■ ■ cTj) is the total number of branches extending 
aia2 ■■■Oi. Clearly, ^ e[ai ■■ - ai) = Bn for any i. Let sf]^ = Yl e^{(Ti - ■■ ai). 

Assume that ■ ■ ■ cij) is a symmetric random variable associated with node 
(Ti (J2 ••• CTj. We assume that these random variables are i.i.d. Strictly speaking we 
should be using superscript N for the nodes, random variables etc. But for ease in 
reading we suppress the superscript. This should be borne in mind. We do assume 
that all our random variables are defined on one probability space. Consider the map 
Ejv ^ M.'' defined by 

(7 ^ = (C(o-i), C(o-it^2), • • • , ^{(Ti ■ ■ ■ (T„)). 

Let fj^N be the induced probability on R" when Ejv has uniform distribution, that 
is, each a eT^n has probability In other words, for any Borel set A C R", 

I,n{A) = ^i^{a : e A}. 
Un 

In particular, if ^4 is a box, say A = Ai x • • • x A„, with each Aj C R, then 

1 " 

<C7l---(T„> i=l 

Denote QiN = -P(C ^ ^i) for 1 < i < n. Since all the ^(cxi ■ ■ - ai) (for fixed N) 
are i.i.d., we did not use suffix for ^ in defining q^N- However since the common 
distribution will in general change with N, q^N would in general depend on N. Then 

E//jv(A) = qiNq2N • • ■ QnN- (2.3.1) 

Here now is the basic result. 

Theorem 2.3.1. Lei A = Ai x • • • x A„ c R". Denote qiN = P(C e Aj) for 
l<i<n. 
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//" Tl, BiNqiN ■ ■ ■ QiN < oo, for some i,l < i < n then a.s. eventually, 

N>n 



/xjv(A) = 0. 



2 

b) If for alii = 1, ■ ■ ■ ,n, ^ < oo, then for any e > a.s. eventually, 

N>n * 

(1 - e)E/i^(A) < fiN{A) < (1 + e)E/i^(A). 

In proving the first part of tlie tlieorem we will use the idea of Dorlas and Dukes 
T9] . where as for the last part, we follow Capocaccia et a/ [8]. 



Proof, a) Let jo be such that J2 Bj^^qiN ■ ■ -qj^N < oo. Then 

N>1 



M^) = 7^ E fliA,(e(^i---o^.)) 

(71— an i=l 



1 30 

< ^ E n lA,(^(c^i---c^i))e((Ti---ajJ 

DN cTi-ay. i=l 



= Gn, (say). 
Let be the event {G^ = 0}. Observe that 



JO 



[^(Tl---0-jQ 1 = 1 

Now by Chebyshev's inequality, 

jo 

ai-ajg 1=1 

Thus by assumption and Borel-Cantelli, Ajsr will occur a.s. eventually, i.e. G^ = 
and hence (A) = 0. 
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b) We first get an estimate for the variance of /iAr(A). 



mr(/iAr(A)) 



d2 S 



B 



E n lA.(e(c^i ■ ■ ■ ^i))lA,(e(n • ■ ■ r,)) - g?^ ■ ■ ■ 



< E E EniA,(e(^i---^.)) n iA.(e(^i---^,,))iA.(e(ri 



< 



1 n J 

1 " 



i=j + l 



N j=l 



2 2 2 

-g2" 2^^ ■ ■ ■ QjN'l(j+i)N ' ' ' InN^jN 



N j=l 



Hence for any e > 0, by Chebyshev's inequality and fl2.3.ip 



1 " 

•(|/i^(A) -E/i^(A)| > eE/i^(A)) < ^j^Yl 



s2 



n)) 



^ llN ■ ■ ■ QjN 

But, in view of the assumption, the sum over N of the right side is finite. So by 
Borel-Cantelh lemma, a.s. eventually, 

(1 - e)EfiN{A) < fiN{A) < (1 + e)EfiN{A). 



□ 

For GREM type regular trees the condition above will simplify as follows. This 
result is in [19] though not exphcitly stated. 



Corollary 2.3.2. Let k{i,N), 1 < i < n be positive integers with Y2k{i,N) = N. 

i 

Suppose that the tree has 2'^'^*'^^ nodes of the i-th level below each node of the (i-l)-th 
level. 

^) U E 2'^'^^'^^"' ^''^^'^^qiN ■ ■ ■ QiN < oo, for some i,l < i < n, then a.s. eventu- 

N>n 

ally, /iAr(A) = 0. 

^) U E 2^^''^^'^^'^""^''^'^''^^^ q^^ ■ ■ ■ < oo, for each i = 1, ■ ■ ■ ,n, then for any 

N>n 
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e > 0, a.s. eventually, 

(1 - e)qiN ■ ■ ■ QnN < /iiv(A) < (1 + e)qiN ■ ■ ■ QnN- 

2.4 Exponentially Decaying Driving Distributions 

We fix a number 7 > 0. In this section we consider an n level GREM where for the 
particle system the random variables C,{o'i ■ ■ ■ (Ji) are i.i.d. having probability density 

(f^Nai^) = Const. e -^JVT^ — 00 < X < 00, 

More precisely, 

1 /7\^ _ i-r 
<pNr<{x) = [j^) e - 00 < a; < 00. (2.4.1) 

Note that is independent of and is two sided exponential density with 
parameter 1. On the other hand, (f)N,2 is Gaussian density with mean and variance 
A^. Of course, 7 can be larger than 2 as well. 

If we define the map S^v = Ui 2''^''^^ ^ by 



AT ' AT ' ' N 



and transport the uniform probability of S to M", we get a probability fiN{^^) on M". 
In evaluating the free energy, we will be applying Varadhan's lemma (Proposition 
10.3.41) . This explains the factor in the above map, which was not present in the 
general framework of Theorem 12.3.11 

Let A = Ai X ■ ■ ■ X A„ be a non-empty open rectangle of M". For such A 
and 1 < i < n define rrii = inf |x| and Mj = sup |a;|. Clearly, mi < 00 for all i. 

Observe that in case > then Aj C (— Mj, —nii) U {mi, Mi) and in case mj = 
then Aj C [—Mi, Mi). In any case Aj C (— Mj, — m^] U [mi, Mi) for each i. Let 
m = (mi, ■ ■ ■ ,mn)- Also define gjjv = P{j^ & Aj), for 1 < i < n. 

First let us assume that 7 > 1. Let J C M be an interval. Denote m = inf Ixl and 

M = sup \x\. Denote g^r = ^ J)- With these notations, we have the following 
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two observations: 



1 f°° 



7-1 

< ^ ^ / e-^dx 



7-1 

j-e T 



r(i)(iVm^) — 

with the understanding that when m = 0, the last expression is 1 and 



(2.4.2) 



7-1 I-NM 



Nm 



1 / T 7-1 



> ^^T^TT / , — e-^dx (2.4.3) 

' 7 



for any < 5 < ^(M^ - m?). 

Now let 'y < 1. J,m,M as above except that J is now assumed to be bounded 
interval of M so that < m < M < oo. With as earlier, we have 



1 t"^ 

1/7 7-1 



7-1 pNM 
Nm 



X T e ^(ix 



I 7 
7-1 

< ^-^^ 7^ / e-^dx 

7-1 

7 ^ 



(2.4.4) 



r(i)(iVM7)^ 



with the understanding that when m = 0, the last expression is 1. The difference 
between ( 12.4.41) and (12.4.21) is just that in the penultimate inequality the lower bound 
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of the integral appeared in fl2.4.2p where as in (12.4.41) . the upper bound of the integral 
appeared. 



7-1 pNM 



^ P ( i . ./) > (^) ' L '""^"^ 

i / ^ 7-1 _ 

> „.i . / X T e ^dx 



— 7TT / X T e dx O 

(TV 1^+5) 



1 7 



7^ 

> —e 

2r(i)(iVm7)- 



for any < (5 < ^(M'^ - m"^). The difference between fl2X5D and fl2X3D is just 



that in the penultimate inequality the upper limit of the integral appeared in (12.4.31) 
where as in (12.4.51) . the upper limit of the integral to bound and lower limit to 

7-1 

bound X T is used. Moreover, when m = the lower bound for g^v can be given by 
^1, X ' e~^dx for < 5i < 82 < —■ As earlier, this bound does not depend 
on A^. 

From now on we assume that ^ for 1 < i < n with pi > 0. Clearly, 

Y,Pi = l- Let 

^ = {Jg M" : < ^j9ilog2, l<A;<n}. (2.4.6) 

i=i 1=1 

Proposition 2.4.1. =^ 60 a.s. as N —>■ 00. 

Proof. For any e > 0, define A(e) = [— e, e] x ■ • ■ x [— e, e] C M". By Markov inequality, 

P(/x;v(A'=(e)) > e) < -E/i^(A^(6)) < > eN) < — P(e > eN) < — C^e'^T, 

e e e e 

where Cat can be obtained from (12.4.21) for 7 > 1 and from (I2.4.4p for < 7 < 1. Since 
log Civ — as — » 00, the proposition follows from the Borel-Cantelli lemma. □ 

Proposition 2.4.2. // A fl \E' = 0, then a.s. eventually yUAr(A) = 0. Moreover, the 
sequence {^j^n} is supported on a compact set. 

Proof. A n = implies m ^ \E'. This is seen as follows. By definition of rrii, either 
rrii or —rrii is in Aj. Thus for each i, there is an = ±1 such that e^mj G Aj. Thus 
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the vector (eimi, ■ ■ ■ , enmn) € A and hence ^ By the symmetry of m ^ \1/ as 
well. As a consequence, for some j , 1 < j < n, 



J 7 J 



E— >EP^l°g2. (2.4.7) 

i=l i=l 

For 7 > 1 using fl2.4.2p and for < 7 < 1 using fl2.4.4p we can say that qny < 
CiN^^' ~ where 4 logCjAr as 00 for 1 < i < j. Hence as a consequence of 



N 

(12.4.71) and the fact ^ Pi, we have 



ATM Af>l 



Thus by Corollary [2321 a.s. eventually fiN (A) = 0. 

To see the last statement of the Proposition, fix any 6 > 0. Let J be the compact 
set [— log 2 — 6, + log2 + 5]". Since the complement of this set is union of 2" open 
rectangles of M", each of whose closures are disjoint with \E', the earlier part implies 
that eventually fiN^J) = 1. □ 

Proposition 2.4.3. // (A fl \E')° 7^ (j), then for any e > a.s. eventually 

(1 - e)qiN ■■■qnN < yU7v(A) < (1 + e)giAr ■ ■ ■ qnN- 

Proof. The assumption (A fl 7^ 'P implies fh G . Indeed, since (A fl \t')'^ 7^ 
pick (xi, ■ ■ ■ , Xn) G (A n ^)°. By symmetry of ^ , ■ ■ ■ , |a;„|) G as well, and 

now < mj < for all i yields (mi, ■ ■ ■ , m„) G . 

We are going to show that the hypothesis of Corollary 12.3.2( b) holds. Fix z, 
1 < i < n. Using (I2.4.3P for 7 > 1 and using (12.4.51) for 7 < 1, we can say that 

1 

qjN > CjNe~^~~^^ for 1 < j < n with sufficiently small 6 > 0. Thus 



r,-(fc(l,JV) + -+fc(i,7V)) -1 _ _ 1 ^ 

^ qiN HiN ^ ^ 
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Since m is an interior point of there is an a > such that Yl Pj log 2 — Yl ~ ^ 

j=i i=i 

a. Now use the fact that _^ and ^logCjAr ^ as — > oo to deduce 

that eventually Yl ( ^%r^ ^og2 — — + jj: log CiN) > «• Making S > smaller, if 
i=i V / 

necessary, assume that eventually Y (^^%r^ log 2 ~ — + ;^ logCjAr") — i5 > a. Hence, 

.7 = 1 V / 



eventually e 



E(^log2-^ + ^logC,iv)-i5 



< e As a consequence. 



7V>1 



r,-(fc(l,Ar)+...+fc(i,7V)) -1 . . .^-1 / 



Hence by Corollary 12.3. 2[ the proposition follows. □ 

Remark 2.4.1. (A fl ^ implies in particular, that pi > 0. In fact, '^^ ^ (p iff 
Pi > 0. 

Now, we have the following. 
Proposition 2.4.4. For a.e. sample point uj, 

lim llog/i^(A) = -Yf ^f{^n^ry^<P 

= -oo if An = (f). 

Proof. When A fl \E' = 0, the result is immediate from Proposition I2.4.2[ 

Assume that (An^)° 7^ 0. Fix any e, < e < 1. Let 7 > 1. By (|2A2|) . ^ log qiN < 

T 7 

log CiN - ^ where ^logC^Ar as A^ ^ 00. Hence lim sup ^ log g^Ar < 
Similarly, by using (12.4.31) we get liminf log qiN > Thus lim log qiN exists and 
equals to ^ for each i. The same holds even if < 7 < 1, where we need to use 
(12.4.41) and (I2.4.5p . Then by proposition 12.4.31 we have a.s eventually, a.s. eventually 



(1 - e)giAr ■ ■ ■ g„Ar < /iAr(A) < (1 + e)qiN ■ ■ ■ qnN- 

1 Tfl^ 

So by taking logarithms and using lim — logg^Ar = for each i we get the 

N-*oo N 7 

proposition. □ 
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Let us consider the map J : ^ M, defined as follows, 

n 

HI) = igNr .fie* 

= oo otherwise. 

Theorem 2.4.5. Almost surely, the sequence {^n} satisfies LDP with the rate func- 
tion I . 

Proof. Let A be the collection of all rectangles A = Ai x ■ • • x A„ C M" such that 
each Aj is a bounded interval with rational endpoints and either A fl = or 

(An^)V0- 

It is easy to check that A forms a base for the usual topology of M". For A E A, 
by Proposition 12.4.41 the limit, — lim — log/iAr(A) exists almost surely. Denote this 
limit by La- Since ^ is a countable family, out side a null set, these limits are well 
defined for all A G 

In view of Proposition 10.3.31 to complete the proof, we show that for x G M", 

I{x) = sup La. (2.4.9) 
If 5; ^ \E', clearly sup La = oo = I{x). 

Now consider, ) G Suppose xgAg^. IfA = AiX---x Aj^ 

with rui = inf \y\, then rrii < IxA and hence — < Therefore, by Proposition 

Eia = E f < E Thus 

i=l i=l 

sup La < I{x). (2.4.10) 

On the other hand, consider e > so that e < for any i with Xi ^ 0. Let A be 
the box with sides Aj = (xj — e,Xi + e). By choice of e, nii = inf |?/| equals \xi ± e| 
depending on the sign of Xj. Of course, if Xj = then = 0. Thus for the A so 
constructed, we have. La = E ^^'^"^^^ • This being true for all sufficiently small e, 

{i:xi^0} 
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we conclude that 



sup L^>J2 = Hx) (2.4.11) 



(12.4. lOp and (12.4.111) complete the proof of (12.4.91) thus completing the proof of the 
theorem. □ 

We shall now proceed towards an expression for the free energy. Denoting /(x) = 

n 

4 = 1 



limilogZjv(/?) = log2 + limilogE^e-^/ 



log2-inf{E/?a,x, + E^}- 



by Proposition 10.3.41 This last infimum equals inf V (— f3aiXi]. Since (3 > 0, 

flj > it is easy to see that the above infimum is attained when all the Xi are negative. 
In other words, by symmetry of \E', the infimum is attained at a point —x for some 
X G = \& n {x : > for 1 < i < n}. Thus 



1 " / 7 

hm - log Z^{f3) = log 2 - inf ^ - 

i=l ^ 



1^ Ctj^X'^ 



In this way, for the above mentioned class of driving distributions, the free energy 
exists almost surely and is a constant. Not only that, finding an explicit formula for 
the free energy reduces to calculating the above infimum. 

Remark 2.4.2. It is also worth noting that the LDP holds good even when the driving 
distributions at various levels are different. To be more specific, let us fix n numbers 
7i, ■ ■ ■ ,7„; each greater than zero and consider an n level GREM where the driving 
distribution at the i-th level is 4>N,'yi- More precisely, for any node ai ■ ■ ■ a, at the i-th 
level ^((Ji ■ ■ ■ Cj) has density 0Ar,^.. Of course, all the random variables are independent. 
Define as earlier, the map T^n by 



a 



^(o"i) ^{(^1(^2) ^(cTi-'-CTr, 



AT ' AT ' ' N 



Let fiN he the induced probability on when S^r is equipped with uniform proba- 
bility. The same arguments as above, with = P {^^^^^jj^ G Aj j , will show that 
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almost surely, the sequence of probabilities {hn, N > n} on M" satisfies LDP. In this 
case, with rate function / will be given by 



i=l 



(2.4.12) 



oo otherwise, 



where 



k , 1^,, k 



^ = {xG M" : < ^Pilog2, l<k<n}, (2.4.13) 



1=1 1=1 



with Pi = lim Let, as earlier, be the part of "i! in the positive orthant of 

M". As a consequence of all this, we have the following: 



Theorem 2.4.6. If the driving distribution has density 0Af,7i O't the i-th level, we have 
almost surely, 

hm — log Zjv(/3) = log 2 - inf < V — - paiXi 
N N xe^+ ^-^ \ 7,- 



i=l 



2.5 Inside Out 

A close observation of the above discussion reveals the following cute idea. Though 
the identification, at first glance, will look like very simple, its implication in GREM 
will be understood through the rest of this chapter. 

Let for each j, 1 < j < n, we have a sequence of probabilities {A]y, > ri} on M 
which obey LDP with a strictly quasi-convex continuous good rate function Xj. That 
is, Xj has compact level sets and for any two distinct points x and y in {0 < Xj < oo} 
we have Xj{6x + {1 — 6)y) < max{X j{x),Xj{y)} for any 6 with < 6 < 1. For the 
sake of simplicity, we will also assume that Xj{0) = 0. The assumption of strict 
quasi-convexity is purely a technical assumption and this can be replaced by similar 
other conditions also. For example, one can replace this by requiring that the set 
{x : Xj{x) = a} is a nowhere dense set for every a > 0. We mentioned this condition 
in Remark 11.2.21 but there we demanded this only for a = log 2. Now, let us denote 
{+1,-1}^ by Siv- For each A^, let k{l, N), . . . , k{n, N) be non-negative integers 
adding to A^ and put Sjat = {+1, — 1}'^(-''^). Clearly, Sat = Siat x T.2n x ■ ■ ■ x S^at and 
we express a G S^v as (TiO"2 ■ ■ ■ cr„ with cTj G Sjat, in an obvious way. Suppose for fixed 
A^, we have a bunch of independent random variables as follows: {^(o"i) : cxi G T,in} 
having distributions Ajy, {^(cricr2) : ^2 G S2Ar,cri G Siat} having distributions and 
in general {^(o"icr2 • ■ ■ <^j-i(^j) ■ ^-j ^ '^jN, ■ ■ ■ , cxi G Siat} having distribution A]^. 
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Define for eacli u, Hn{^^) to be tlie empirical measure on M", namely, 
wliere S (x) denotes tlie point mass at x G M". 

Theorem 2.5.1. Suppose ^ y. q jor 1 < j < n. Then for a.e. u, the 

sequence {fi]\f{uj), N > n} satisfies LDP with rate function J given as follows: 

j j 

Supp{J) = {(xi, ■■■ ,Xn) ■■ J2 ^k{xk) <YjPk log 2 for l<j<n} 

k=l k=l 

and 

n 

Ji^) = Yl 1k{xk) if X e Supp{J) 

k=l 

= oo otherwise. 

Proof. In wliat follows A denotes a box in with sides A^; 1 < j < n where each 
Aj is an interval. The proof consists of the following steps. The steps are executed 
one by one as in Propositions 12.4.21 to 12. 4. 4^ so will not be repeated here. 
Step 1: //An Supp{J) = 0, then a.s. eventually /in (A) = 0. 

Step 2: // (A fl Supp{J))^ ^ (j), then for any e > a.s. eventually 

n n 

(1 - e) n Aiv(A.) < /i^(A) < (1 + e) n Aiv(A.). 

i=l i=l 



if (A n Supp{j)f ^ <p 
if An Supp{j) = 0, 

where Ii{Ai) = inf{X(2;) : x G Aj}. 

To conclude the proof we use the idea of Theorem 12.4.51 □ 

We note that continuity of the rate functions Xj is not necessary, but then one 
needs to go through limsup and liminf of j^log fi^iA) an in Theorem 11.2.11 instead 



Step 3: For a.e. sample point cu, 



lim — log/i7v(A) 

N^oo iV 



i=l 



— CO 
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of limits which we used above. 

The imphcations of the above theorem for GREM [17] are clear. For fixed A^, and 
0" G Stv one defines the Hamiltonian 

n 

H^ia) = NJ2(^i^i^l■■■^i)■ 
i=l 

Here a^, 1 < i < n are positive numbers called weights. In the Gaussian case, it is 
customary to take Yl^l ~ though it is not a mathematical necessity. As earlier, 
Zn{I3) = e'^^^^"^^ ■ Special choices of X% lead to all the known models consid- 
ered. Centered Gaussian were consider in [171EII191E9]. More general distributions 
as well as the cases when some pj are zero were considered in [29]. Moreover one 
could take different distributions for different values of j, see §2. 71 for some interesting 
consequences. Thus the main problem of GREM is reduced to a variational problem. 
Note that, if n = 1, GREM reduces to REM. 



2.6 The Variational Problem 

In this section, we derive explicit formulae for the free energy. We return back to the 
driving distribution given by (12.4.11) . namely, having density 

<I^N,yix) = — y- i^- ) e - OO < X < oo. (2.6.1) 



'7' 



We now consider the model with same driving distributions at different levels. In this 
setup, we need to calculate the infimum 

M y(^- f3a,x, (2.6.2) 

in order to get an explicit formula for the limiting free energy. Note that, putting 
7i = 7 got all i in (12.4. 13p we get 



^ = {J G M" : ^ < ^7Pilog2, 1 < A; < n} (2.6.3) 

1=1 i=l 

and as usual is the part of \E' in the positive orthant of M". 
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2.6.1 7>1 

First let us assume 7 > 1. To evaluate the infimum let us put, ior 1 < j < k < n, 



By,,).|(|^i^^i?5)ligl| . (2.6.4) 

+ ■■■+<'*"' 

Set To = and for Z > (integer), 

Pi+i = mm B{ri + l,k) n+i = max{i > n : B{ri + 1, i) = A+i}- (2.6.5) 

k>ri 

Clearly, for some K with 1 < K < n, we have tk — n. Put (3o — ^ and Pk+i — 00, 
so that = /3o < A < /52 ■ ■ ■ < Pk < Pk+i = 00. 

Fix i < K and let P G (/3j, Define x G ^l'^ as follows: 

= (A«i)~ if e + 1, ■ • • , n} for some /, 1 < / < j (2 6 6) 

= {(3ai)^ if i >rj + l. 



Claim: inf > I — — f3aiXi ) occurs at x. 



In order to prove the claim, fix any x e '^^ . For k < j (recall that j < K was 
fixed above), first note that, by Holder's inequality, 

1 7-1 

rk / 'rfc \ 7 / '■■fc \ 7 '"fc 

i=l \i=l / \i=l / i=l 

^ I'k rk 

where the last inequality follows from the facts a; e and ^^x] = ^ jpi log 2 so 

i=l 1=1 

rk rk rk 

that ^x] <J2 IPi log 2 = E ^7- 

'■■fc 



Hence, ^ (xj — Xi) > 0. 

/3 



Since (3 > Pj, we have (^ — 1) > for 1 < / < j. Moreover since Pi are increasing 
with /, these number (J- — 1) are decreasing. It follows that. 
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In other words, using the definition of Xj, 

'^PttiiXi-Xi) >^x]~^(Xi - Xi). (2.6.7) 



1=1 1=1 



Now 



t{f-'^'"--)-t{J-^-'- 

^ E (f + - - ^) by mm 



(2.6.8) 



i=l 

>0. 



7 7 



where in the last inequahty we used XixJ ^ < -xj + -^x] . 

7 

On the other hand, utihzing the definition of Xi and the inequahty jSaiXi < ^ + 

1 7 

^^{Ptti)^ we have. 



7 



i=rj+l ^ ^ i=r j+1 ^ ^ 

A /x^ 7-1 7 \ (2.6.9) 

= E U + ^(/?a.)^-/3a.x. 

>0. 

Clearly, fl2.6.8p and fl2.6.9p complete proof of the claim. This argument is in fact a 
generahzation of Dorlas & Dukes [TH], Capocaccia et. al. [S]. 

All this leads to the following explicit formula for the free energy. 

Theorem 2.6.1. For GREM with driving distribution having density (f)N,'y o-s defined 
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in ^2. 6. almost surely, 



1 n ^ n j 1 ri 7 



i = l i=T-,_i + l 

^f f3,<P<(3j+,,0<j<K-l 



Observe that for 7 = 2, that is when the driving distribution is Normal, with 
proper identification of parameters this is essentially the same formula as in [8|[T9]. 
In defining the Capocaccia et. al. use a variant in [8J (§3.2). In defining r^, 
Dorlas and Dukes |19] consider the least index, where as recall that, we define r/+i as 
max{i > r/ : B{ri + 1, i) = A+i}- This makes no difference because 'nothing happens' 
in between these two indices. This follows from the fact that if > 0, 6j > for 
z = 1, 2, 3 and = -^i±^4^ < -^i±^, then this will imply that ^ = -^4^ < ^. So 

' ' ai 01+02+03 01+02' oi 02+03 02 

defining r;+i as min{i > r; : B{ri + l,i) = A+i}? when {i > ri : B{ri + l,i) = A+i} 
is not a singleton set, /5;+2 will be same as A+i. And this will continue until the 
maximum index of the set {i > ri : B{ri + l,i) = A+i} is attained. 

Moreover, the weights in Dorlas and Dukes [19] are incorporated in the density, 
there was no need to assume = 1, their parameter J can be incorporated in the 
weights. In fact, there is one benefit of putting the weights in the density. The large 
deviation technique will easily allow us to consider variable weights ajjy depending 
on at the i-ih level instead of a constant weights a^. For instance, let ajjv > for 
all 1 < i < n and > 1 be the weights of the i-th level for the A^ particle system 
with aiN — as A^ ^ 00. When the weights did not depend on A^, they were not 
brought in the large deviation argument. The free energy was 

" / 2 \ 
log 2- inf V(^-/5a.x,), (2.6.10) 

where 

= {^G M" : ^-xi < ^pilog2} n {x G M" : > 0, 1 < i < n}. 

i=l i=l 

If we consider variable weights as above then they must enter in the large deviation 
arguments. If ^(i, A^) ~ A/'(0, j^), then it is not hard to show that the distribution of 
cbiNiihN) satisfies LDP with rate function Accordingly, we get the limiting free 
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energy as 



where 



(2.6.11) 



T+ = {J G : < Xl^* 2}n{x eW : Xi>0,l<i< n}. 

1=1 * 1=1 

Though we seem to have two different optimization problems in (12.6. lOp and fl2.6.1ip . 
they produce the same result as the former can be transform to the later using affine 
transforms Ui = So this will lead to the expected result that the limiting free 
energy of the Gaussian GREM is continuous with respect to its weights. Not only in 
the Gaussian case this can be made precise in all the models (7 > 1) discussed in this 
subsection by the same way and for other models with some extra efforts. 

In the Gaussian case, that is when 7 = 2, two simple cases are worth mentioning. 
The numbers (3j mentioned below are same as the above, in these particular cases. 

Corollary 2.6.2. (Gaussian Gase) 



z) Lei < 4 < 2| < ■ ■ ■ < £f . Put (3, = y^^El^ j^r j = 1, ■ ■ ■ , n. Then a.s 



lim^logZ^(/3) =log2 + f E«? ifP<Pi 



N 



1 

n j 2 

E P^log 2 + E I3a^ V2p,,log2 + ^ E 
i+i 1 j+i 



if Pi </3< A+i fori <J<n, 



/3Ea^V2pilog2 iff3>f3n. 



1 



U) Let^ = ^ = ■■■ = ^->0. Then a.s. 



lim^logZ^(/3) = log2 + fE«? ^//5<yffl 



Remark 2.6.1. We say that an n level GREM with some particular driving distribution 
is in reduced form, if the limiting free energy of the model can not be obtained from 
any k level GREM with same driving distribution where k < n. 

For a Gaussian n-level GREM, as the above analysis shows, if it can not be ob- 
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tained as a fc-level GREM then /3jS are defined for 1 < i < n. On tlie otlier fiand if it 
can be obtained as tlie energy function of a fc-level Gaussian GREM for some k < n, 
then the /3jS of the construction are only for 1 < i < k. 

If a GREM is in reduced form, according to this definition, we do not know whether 
its energy function can be obtain as that of a A;-level GREM for some k < n with, 
of course, different driving distributions. Along with the setup of the model in this 
subsection, we are lucky enough to get the explicit expression of the limiting free 
energy. Moreover we know the explicit expression of the /3jS where the expression of 
the free energy are changing. We observed in this case that there may be at most n 
many /?jS. But we do not know, whether this the intrinsic property of the model or 
there are some driving distributions so that for an n-level GREM, we can get more 
than n many /3jS. 

Remark 2.6.2. It can be shown that the energy function determines the parameters of 
the model for every 7 > 1 and one could characterize functions those arise as energy 
functions for GREM. As observed in in the above Corollary, an n level GREM may 
reduce to a A; level GREM for some k < n or even to a REM. In such a case, some 
weights ttj occur in groups and get added up. Of course, in such a case when the model 
is not in reduced form, clearly it is not possible to recover the weights from the formula 
for energy. But it is interesting to note that when the GREM is in reduced form, we 
can recover the parameters from the energy function. To make the statement precise 

first of all note that, in this set up, GREM is in reduced from if and only if all the 

7-1 7-1 7-1 

77 7 

Pi, a,- are non zero and — — < — — < ■ ■ ■ < — — , let us assume this to be the case. 

7-1 

This is similar to that of the Gaussian case. Note that, in this case 3i = ('^^'^"g^^ ^ 
for 1 < i < n. From Theorem 12.6. it follows that the limiting free energy S{f3) is a 
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continuous function with £^(0) = log 2. It has a continuous derivative with 







if /? = 



Further, 



E +E/3^'«^'" iiPk<f3<f3k+i,0<k<n-l 

i=k+l i=l 



1_ 7 
7-1 ^7-1 
i 



if 



£"{(5) 



^/5^E«7"S forO</3</3i 



2-0- 



ri 7 



pT/9^E«r\ for/3fc</3</3fe+i, for l<A;<n 
fc+i 



0, 



for (3 > pn- 



The energy function can be characterized in this case. To start with, observe that 
the above energy function has the following properties: 

i) ^(0) = log 2 and ^(0) = 0, 

ii) £ is a continuously differentiable function, 

iii) denote Xk = ^^^''^"f^'^ ; = -L-^pT^Y^^r and ^ = ^ then = xo < 



and 9 > -1 with 



Xi < ■ ■ ■ < Xn < Xn+1 = OO; Ci > C2 > • • • > C„ > C„+i 

£"((3) = (1 + 9)0^(3" in (x^.i, x^) for 1 < i < n + 1. 
Conversely, let / be a function on [0, oo) such that 

i) /(0)=log2and /'(O) = 0, 

ii) / has continuous first derivative, 

iii) there are finitely many points < Xi < ■ ■ ■ < Xn and Ci > • • • > c„ > c„+i — 
so that the left and right derivatives of /' are unequal at for 1 < i < n and 
f"{x) = x^Ci in {xi-i, Xi) ior 1 < i < n + 1 with xq — and Xn+i = oo. Then / is 
the energy function for 7-GREM with driving distribution having parameter 7 = 
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P 



« (6»+2)log2 



^Tzzn{xi{c^ - Ci+i))^+2 and = (c, - c,+i)9+2 for 1 < i < n if 



E 



1=1 



log 2. 



(2.6.12) 



In particular for a Gaussian GREM, it is in reduced from if and only if all the pi, 
are non zero and ^i- < p- < ■ ■ ■ < f|. When that is the case, from Theorem 12.6.11 with 
7 = 2, it follows that the limiting free energy £{I3) is piecewise quadratic continuous 
function with £{^) = log 2. It has a continuous derivative S'{j3) with S'{0) = and 



1 



0, 



V2pi log 2 



ai 



f^al in ( v^liiMl, for 1< A; < n - 1, 

* ' \ (7,1. ' ai. I 1 — — ' 

ii(3> 



V2pn log 2 



Moreover, if / is a function on [0, oo) with /(O) = log 2 and /'(O) = so that 
there are finitely many points < a;i < ■ • ■ < x„, where the left and right derivatives 
of /' are unequal and /" is a positive constant, say, Cj in {xi-\,Xi) with xq = and 
= oo. Then / is the energy function for some Gaussian GREM iff 



Cl 



> ■ ■ ■ > c„ > c„+i = and ^ x- - q+i) = 2 log 2. 



(2.6.13) 



i=l 



2.6.2 7=1 

Now, let us assume 7 = 1. Note that 7 = 1 represents the two sided exponential 
distribution with mean and parameter 1. In this case, we can not use the above 
argument directly as the ratios B{j,k) defined in fl2.6.4p . the constants ai appear 
with exponent However, to get the expression for the free energy, we can directly 
proceed to evaluate 

n 

inf y^(l - Pai)xi, 



i=l 
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where 



= < X e M" : ^ \xi\ < ^Pilog2, 1 < k < n \ n {x : Xi > ioT 1 < i < n}. 

I 1=1 i=l J 

This is what we will do now. To calculate the infimum, let us set tq — and for 
A; = 1, 2, • • • , let us define j3k, as follows: 

(5\ — min{ — : 1 < i < n} 

ri = max{i : — = 



and in general, for /c > 1, 

= min{— : Vk-i <i<n} 
Tfe = max{i : Vk-i <i <n, — = 

Obviously this process stops at a finite stage say at K, so that Pk = ^ and tk = n. 
We put — oo. For example, if ai > 02 > • • • > o-n then = — , = /c for 

k = 1,2, ■ ■ ■ ,n, and K — n. On the other hand if ai < 02 < • • • < Qn then Bi — — , 

tin 

ri = n and K = 1. 

Clearly, = /5o < /3i < • ■ ■ < /^i^ < /^x+i = 00. 

Remark 2.6.3. The case 7 = 1 can also be recovered as a limiting case from the 

previous section. We can proceed by defining as done in the last subsection. But 

7-1 

now we have to take hmit hm-^j^i | (E2+z±£fc)li!^ j define B{j,k). A simple 

calculation shows that, Bij,k) — ^ . Hence the dhS defined in the earlier 

subsection lead to the same formula as above when /9 | 1. 

Now, fix j < i^T and let /3 e [Pj, Pj+i). Define x e as follows: 

Xi — Y] Pi log 2 if i = r; for some 1,1 < I < j 

3=rt-.)+i ' - (2.6.14) 

= otherwise. 
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n 

Claim: inf 7^(1 — I3ai)xi occurs at x. 

i=l 

In case j = that is /3j — I3q — the claim is obvious. Indeed, for P < Pi, 
(1 — Pui) is positive for all i, the infimum occurs at x with Xi = for all i. So let 

us assume that j > 1. First note that, since P > Pj, we have P > Pi for 1 < I < j 
and (1 — ^) = (1 — pdri) < . Moreover since Pi are strictly increasing with /, the 
numbers strictly decreasing, that is, a^i > a^j > • • • > a^^ . Now to prove the 

claim, fix any x e 



y^(l - Pai)xi - - /3ai)xj 

1=1 1=1 

> ^(1 - pai)xi - ^(1 - pai)xi 

i=l i=l 

(Since (1 — /Soj) > and = for i > r^) 

i / -n 

= ^ ^ (1 - /3aj)Xi - (1 - Pari)xri 
1=1 y=r(j_i)+l 

(By definition of Xj) 



1=1 \i=r(,_i)+l 

(Since by definition a^, > for r(i_i) + 1 < i < r^) 

= (1 - /3a^ .) I XI ~ 5Z J + 5Z /^("'•(i+i) - «n) ( XI ~ ^'•i 
\ 1 1 / i=i \ 1 1 

= (1 - Par^) ^{xi-pi\og2) + X/3(ar((+i) - flrj XI " Pi ^og 2) 
1 1=1 1 

>0 



The last inequahty follows from the facts that (i) by definition ~Pi ^og 2) < 0, 

1 

(ii) (ar((+i) ~ Ojri) < for 1 < / < j and (iii) (1 — Pa^^) < 0. Hence, the proof of the 
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claim. 

Here then is the formula for the free energy. 
Theorem 2.6.3. For two sided exponential GREM, almost surely, 



1 log2 ^//5</5l 

lim — logZ^(/?) = <^ 

log2 + E(/5an-l) E vdog2 z/ /3, < /3 < . 
I (=1 n-i+i 

Remark 2.6.4. Once again, the free energy for the case 7 = 1 can be recovered from 
that of 7 > 1 as a limiting case. It is quite easy to check that with notation of f3i 
and r; as in subsection 2.6.1, lim^j^i(/5iaj) t-i = ^ ^ pjlog2 where k = : 
Qi = maxrj;_^j+i<i<r, ctj & + I < i < ri}. Moreover, for Pai < 1, we have 
lim^ji(/?aj)~ = 0. So = lim^jiXi(7), where Xi as given by fl2.6.6p . may not give 
Xi as defined in (12.6.141) . The only difference will be that Xj = for + 1 < i < ri 
and Xri = Pj ^ whereas, with the same notation of k as above, yi = 

i=r(;_i) + l 

i X] Pi log 2 for those i where r(i„i) + 1 < i < ri and aj = maxj,^j_^j_,_i<j<r; Oj. 

n 

But it is easy to see that if inf > (1 — Pai)xi occurs at x, then it will occur also at 

i=l 

y = {yi). Thus the limiting free energy in the case of 7 = 1 is nothing but the limiting 

(7^1) case of the limiting free energy of the GREM where 7 > 1. 
Thus Remarks 12.6.31 and 12.6.41 will lead to the following 

Theorem 2.6.4. If £^{j3) and £{(3) denote the limiting free energy as given by The- 
orems l276J\ and \2.6.^ respectively, then for all P >0, almost surely, 



\im£,{P) = £{P). 

As in the case of 7 = 2, for the case 7 = 1 also two special situations are worth 
mentioning 
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Corollary 2.6.5. i) Let ai > a2 > ■ ■ ■ > an- Then a.s. 



lim i-logZ^(/5) = { 

N^oo iV 



log2 ^fP<^, 
log2 + ^(/5a.-l)p.log2 z/^</5<^ 



i=l 



< i=l 



ii) Let ai < a2 < ■ ■ ■ < an- Then a.s. 



(3an\og2 tf(3>^- 



lim -\ogZj,{P) 



Remark 2.6.5. Returning to Theorem I2.6.3[ it is interesting to note that exponential 
GREM with parameters [pi, ■ ■ ■ ,Pn, ai, ■ ■ ■ , a„) is equivalent to GREM with param- 
eters {p[, ■■■ , p'^, ■ ■ ■ , a'^) where p'^ = ^ p^, P2 = 11 Pj, ■ ■ ■ ,p'k = E Pi and 

1 ri+l T-(^_i)+l 

This is evident from Theorem 12.6.31 Here 'equiv- 
alent' is used in the sense that for every (3, both systems have the same free energy. 
Thus, in order that an n-level GREM does not collapse to a lower level GREM it is 
necessary and sufficient that the weights a, be strictly decreasing. One should keep 

in mind that we are using the same distribution at all levels of the GREM. 

The purpose of the following remark is to show that the energy function determines 
the parameters of the model. One could characterize functions that arise as energy 
functions for exponential GREM. 

Remark 2.6.6. As observed in the previous Remark, an n level GREM may reduce to 
a K level GREM for some K < n. In the exponential GREM, some weights do 
not appear in the formula for free energy. When such a thing happens it is clearly 
not possible to recover the weights from the formula for energy. It is interesting to 
note that when the GREM is in reduced form, we can recover the parameters from 
the energy function. Here is the precise statement. 

Since an exponential GREM is in reduced form if and only if ai > ■ ■ ■ > > 
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and Pi 7^ for 1 < i < ra, let us assume this to be the case. Let £{l3) be the 
energy function, that is S{j3) = Urn log Z]\r{P). From Theorem 12.6.31 it is easy to 
see that S{l3) is a piecewise hnear continuous function of j3 taking value log 2 near 

k 

zero. Further, its derivative S'{P) = ^ajPjlog2 in (^,^^)- These properties are 

1=1 

good enough to show the following: S{P) uniquely determines all the quantities pi 
and Oj. In other words, the energy function identifies the parameters. 

If < a;i < ■ • ■ < x„ be the points where the left and right derivatives of S{P) 
are unequal, then = ^. Further, if £'{(3) = ci in {xi^Xi+i) then pi = ^'^^l~^2~^'^ for 
1 <i <n. Here Xq = and = oo. 

In fact the above considerations lead to a characterization of energy functions for 
exponential GREM. Suppose / is a continuous function on [0, oo) with /(O) = log 2. 
Further suppose that there are finitely many points < xi < ■ ■ ■ < x„ where the 
left and right derivatives are unequal and /' is a constant, say, Cj in (xj,Xj+i). Here 
Xo = and x„+i = oo. Then / is the energy function for some exponential GREM iff 

n 

= Co < ci < ■ ■ ■ < Cn and ^^Xi(ci — Ci_i) = log 2. (2.6.15) 



i=l 



2.6.3 0<7<1 

Now we come to the case 7 < 1. Unlike in the above two subsections, here we 
have not been able to derive the closed form expression of the free energy for general 

n level trees. For 7 < 1, the function ^ (— — (3aiXi \ is not a convex function, 

1=1^'^ ' 

rather a concave function. Moreover the domain \E' = {x G M" : X]i=i I^^T — 
Si=i7Pilog2, 1 < A; < n} is also a non-convex set. Hence in order to calculate 

n / 7 \ 

inf y" ( — - ^aiX, ) (2.6.16) 



with 



^+ = {x G M" : ^|xi|^ < ^7Pilog2,&Xfc > for 1 < /c < n}, (2.6.17) 



i=\ i=l 
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we can not use the convex analysis, as we did for the case 7 > 1. However, by change 
of variables, the problem can be brought back to optimizing a convex function over a 
convex set. Now we specialize to the case n — 2. We shall calculate 

inf I (-xj - Paixi) + (-x2 - Pa2X2)\ (2.6.18) 

xe*+ 7 7 J 

with 

'^+^{{xi,X2) >0: x5'<7Pilog2,x7 + xi<7(pi+p2)log2}. (2.6.19) 

To do so, we transform the problem by denoting -xj — x, -xl — y, 0171' = a, 
a27T = h, pi log 2 — c, P2 log 2 — d and a = ^ so that, a > 1 and we need to calculate 

- sup {(/3ax" -x) + {(3by'' - y)} (2.6.20) 

with 

= {{x, y) > : X <c,x + y <c + d}. (2.6.21) 

Let f{x,y) = {j3ax°' — x) + {Phy"' — y). Since f{x,y) is a convex function and we 
are looking for supremum over a convex set, the supremum occurs at the boundary 
points. Note that ^+ is a polygon. Where as for any e e R, the set {f{x, y) — e} is 
either empty set or a smooth curve. Hence the above supremum occurs at one of the 
corner points, ^ = (0, 0), 5 = (c, 0), C = (c, d) and D = (0, c + d), of Now 

f{A)%f{B) iff/?|^, (2.6.22) 
/(A) I /(C) iff/3= ""^f, , (2.6.23) 

^(^)i^(^) ^^^i5(^TdFT' 

/(S)|/(C) iff/?i^, (2.6.25) 
m I /(Z^) iff b{c + dr > ad^ and /? | (2.6.26) 

f{B) > f(D) if 6(c + d)"" < ac"", (2.6.27) 
/(C) I /(D) iff ac" + I 6(c + d)". (2.6.28) 



Note that the last two relations do not depend on (3. Now comparing all the possibil- 
ities, we obtain the following three scenarios: 
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Let us assume b{c + d)" < ac" + Then it is easy to see that 

b{c + dr < ac- +bd-^[ ^fir ^ ^ 



Now 



and 



imphes 



c+d 1 1 1 

< 7 ^ 7 < 



ac" + ^d'^ bd'^-^ ac"^-^ bd""-^ ' 



1 < c+d 1 < 1 



ac"-i > ac" + ac"-^ > bd'^-^ 



1 c+d 1 1 

< < ^ 7 < 



ac"-i ac" + 6ci" ~ b{c + 6ci"-i ' 

Hence we get 

(f{A) ifO<P<^ 
supf{x,y) = lf{B) if-I^</3<^ (2.6.29) 

Let us assume 6(c + d)'* > ac" + 6o?" & b{c + d)"~^ < ad^'^. Then it is easy to see 
that 

r/(D)>/(c), 

b{c + ci)" > ac" + 6d« ^ i ^(^^ < 



d ^ 1 



< 



Moreover, 

6(c + dT'^ < ac^-^ ^ ^-r < ^— ^ < 4- 

^ ' ~ ac"-i ~ 6(c + (i)"-i ~ b{c + dY-ac'' 

Thus 

11 1 

< ^ 7 < ^ < 



ac°-i ~ b{c + ~ b{c + dy - ac" bd""-^ 
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Hence we get 



'f{A) ifO</3<^ 
snpfix,y) = U{B) if ^ < f3 < 



(2.6.30) 



Let us assume 6(c + d)^ > ac°' + bd°' & 6(c + ^ > ac" ^. Then it is easy to see 



that 



Moreover, 



r/(D)>/(C), 



c+d 



6(c + d)"~^ > ac"-^ 



1 1 

> -n ^ 7 > 



d 



ac^"'^ h{c + dY-^ b{c + d)°' - ac" 



Thus 



d 



c + d 

b{c + dy - ac°' ^ b{c + d)"'^ ^ ac'^ + hd 



< 



< 



Hence we get 



f{A) ifO</5< 1 



snpf{x,y) = {\)-Z VnZ" 



We can conclude the above three cases in the following: 



(2.6.31) 



Theorem 2.6.6. For two level GREM with driving distribution having density (pN^-y 
as defined in Ii2. 6. 1\) with < 7 < 1, we have, almost surely, 



1. if a2 < ai(pilog2)T + 02(^2 log 2)^, then the limiting free energy is 
log 2 

P2 log 2 + /5ai (7pi log 2) ~i 



for0</3<^^ 

ai7T(pilog2) 1 



for -^-^ <f3< -^-^ JET, 

ai7T(pilog2) 1 a27T(p2log2) i 

/3(ai(7pilog2)^ + a2(7P2log2)^) for (3 > — j — ^ tet- 

0127 ^(P2 log 2) T 

i i 1 1 

2. if a2 > ai(pilog2)T + 02(^2 log 2) ^ and < ai(j9ilog2) t , then the limiting 



free energy is 
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log2 forO<f3< 
P2 log 2 + Pai (7pi log 2) ^ for 
/?a2(7log2)^) forP> 



I JET > 

ai7T(pilog2) T 



i 1-7 

ai7T(pilog2) T 



P2 log 2 

T T 

02 (7 log 2) T -ai (7pi log 2) T 



P2 log 2 



I T 

a2 (7 log 2) T -ai (7pi log 2) ^ 



1-7 



5". z/ 02 > ai(pilog2)T -|- 02(^2 log 2) T and a2 > ai(pilog2) t , then the limiting 
free energy is 

log2 for0</3< 
/3a2(7log2)^) for 13 > 



~T 1-7 ' 

027^ (log 2) T 



1 1 — f 

a27T(log2) T 



Remark 2.6.7. Note that in a 2 level double exponential GREM (in the earlier subsec- 
tion) with weights ai and 02, we had at most two cases, namely, ^ < ^ and ^ > ^• 
Where as for 7 < 1, we have three cases. 

Though there are three cases, we can think of them as two cases like the double 



exponential GREM, namely. 



1-7 

ai{piIog2) 7 



< 



1-7 
a2(log2) 7 



and 



1 



> 



ai(pilog2) 7 



1-7 i 
a2{log2) 7 



where the first case has two more subcases, namely, 



and 



1 ^ 

a2(p2log2) 7 



P2 log 2 

T r 

a2{log2)7-ai(pi log 2)7 



) 1 — I 

£12 (P2 log 2) 7 



< 



P2 log 2 



1 1 
02 (log 2)7 -ai(pi log 2) 7 



2.7 Level-dependant Distributions 

We already mentioned that the LDP holds good even when the driving distributions 
at various levels are different. To be precise, fix numbers 7i,--- ,7n; each greater 
than 0. Consider an n level GREM with the driving distribution at the i-th level 
being 0Ar,^. given by (12.6.11) . That is, at the first level for each edge ai the associated 
random variable ^(cri) has density 4>N,'ri- general, for any edge cti ■ ■ - cTj at the i-th 
level, the associated random variables ^(ui ■ ■ - ai) has density 0Ar,^.. Then the map 
Sjv ^ M" by 

f ^(q"i,u;) ^(eriq-2,^) ^(ai ■ ■ ■cr„,a;) ^ 

induce random probability ^n{'^) on by transporting the uniform probability on 
Stv- Theorem 12.4.61 suggest that in this case the free energy of the system will be 




(2.7.1) 
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where is the intersection of 

^ = {x G M" : < ^Pilog2, l<A;<n}, 

i=i i=i 

with the positive orthant of and = hm 

In its generahty, it is very difficuh to have a closed form expression for the above 
infimum. May be there is no general closed form expression, for the infimum and 
hence for the free energy of the system. To make a beginning and to see what one can 
expect, we now specialize to the case n = 2. The limiting frequencies lim^^^^ are 

Pi for i = 1,2. The weights for the two level are Oi and 02 respectively. We assume 
Pi, P2, 0.1,0.2 are strictly positive. 



2.7.1 Exponential - Gaussian GREM 

In this case we consider the distributions at the first level to be 0Ar,i and at the second 
level to be (1)^,2 — that is, exponential and Gaussian respectively. So from ( I2.7.ip . 
the expression for the free energy for this case will read as follows: 

= log 2 - inf {/(x, y) : X, y > 0; X < pi log 2; x + ^y^ < log 2} (2.7.2) 

where 

fix, y) = x(l - Pa^) + - ^a^y. (2.7.3) 

To calculate £{[3) explicitly we proceed as follows. First we discuss the case [3 < 
Then we discuss (3 > —. This later case leads to three subcases. In each subcase com- 

^ ai 

bining the conclusion along with the case /3 < — , we give a full picture of £{P) for all 
values of (3. 



I. f3< ^ 



On the interval [0, 00), the function |?/^ — Pa2y decreases up to /5a2 and then in- 
creases. So when /3 < ^, that is when 1 — (3ai > 0, the above function attains its 
minimum at the point, (0, A a/2 log 2). 
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II. /3> ^ 

If /5 > ^, here is how to calculate the infimum. The function giy) = inf f{x, y) is 
2;iven by 



9{y) 



Pi(l-/3ai)log2 + iy^-/?a2y forO<2/< V2p2log2 

(1 - (3a{) log 2 + \(3aiy^ - (3a2y for ^2p2 log 2 <y< ^m^. 



This is because, when < ?/ < \/2p2 log 2, inf/(a;,?/) is attained at x = pilog2, 

whereas in the other case the infimum is attained at x = log 2 — |?/^. 

Since the required infimum of / is just the infimum of g{y)i one has to calculate 
inf giy) by analyzing g in the two intervals separately. This is what we do 

0<y<V21og2 

below. First note that the function g is continuous. Now we have the following three 
scenarios. 



Al: < V2P2 log 2 



Let us assume ^ < y/2p2 log 2. First let us consider (3 such that — < /3 < 



V2p2 log 2 



In particular, /3a2 < V'^P2 log 2 where as ^ < a/2p2 log 2. So the function |?/^ — /3a22/ 
is decreasing up to /5a2 in [0, V2p2 log 2] and then increasing. Thus in [0, ■\/2p2 log 2], 
g attains its minimum at /5a2. On [^/2p^\og2, A/2Tog2] the function ^^aiy"^ — j3a2y = 
/3ai(|y^ — ^y) is increasing. Hence, g being continuous, for the values of /? under 
consideration, the infimum wi ll occu r at y = (3a2- 

Now (5 be such that p > ^^^l^Sl ^^^^ ^ ^2^2 log 2. Since < V2p2 log 2 
the function |/3ai|/^ — /?a2y is increasing on [\/2p2 log 2,-^/2 log 2] . The function 
^y'^ — Pa2y is decreasing on [0, ^2p2 log 2] attaining infimum at ?/ = -\/2p2 log 2. As a 



2 

consequence, for (3 > ^'^^1^°^ ^ , the infimum of is occurs at ^2^2 log 2. 

Thus combining I. and above para we conclude that if — < y/2p2 log 2 then phase 

transitions take place aX (3 = and /3 = go^ substituting this corresponding 

arguments where minimum is attained in fl2.7.3l) . we have the following 

Theorem 2.7.1. In the Exponential- Gaussian GREM, if ^ < \/2p2 log 2 then almost 
surely, 



hm ^logZ^(/3) 

N—^oo iV 



log2 + i/?2a2 ^fP<i 
P2 log 2 + ^P^aj + Ppiai log 2 ^ < < 



V2p2 log 2 

0.2 



V2p2 log 2 



/3(a2V2p2 log2 + aipi log 2) > 
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We can picture the value of S{l3) against (3 as given below. The values of (3 are 
given under the line and values of S{l3) are given above the line. The phase transitions 
occur at the dark lines. 

Subcase AI 

^p^al + \og2 ^P^al + {ppiai+p2)log2 /3(a2V2p2log2 + aipi log 2) 

\ O 



J_ V2p2log2 V2 log 2 

j3 — > 11 a2 a2 

This case seems rather peculiar. This is indeed a sum of two REMs, as fol- 
lows. Imagine placing exponential random variables ^o-i at the first level and one 
i.i.d bunch {^0-10-2} is placed below each first level node. In other words, consider 
{f](T2 '■ ^1 ^ 2^*^^'^)} i.i.d A/'(0, A^) and set ^0-^0-2 = r/o-j for all ox.ai- Consider the 
corresponding Hamiltonian Hj^io) = ai^^o-^ + 0-2^0-1 0-2 = cti^o-i + 02^0-2 • Let us set 

= ^e^"^^"^!, the partition function for the A^)-particles system consisting of 

exponential Hamiltonian with weight ai. Let Zj^ = ^e"^'''^2, the partition function 

0-2 

for k{2, N) particle system consisting of Gaussian, Af{0, N) Hamiltonian with weight 
02. Clearly, = Z\j ■ Zfj. If, for i = 1,2; £i = \mi-^\ogZ]^ then the exponential 

REM formula [28l[29| yields, a.s., 

(^/3piailog2 if/3>^. 

The Gaussian REM formula (keeping in mind that for fixed, the fc(2, A^) particle 
system has A/'(0, A^) Hamiltonians as opposed to A/'(0, fc(2, A^)) yields, a.s. 



V2l0g2+ ia2/32 if /3 < V2p2log2 

(3a2^2p2 log 2 if/3> 



^m = {T^^:^' /."vfe (2.7.5) 



a2 



One can now verify that, a.s. 

In other words the GREM behaves like sum of two independent REMs, one exponen- 
tial and other Gaussian. The word independent is used here in the sense that there 
is no interaction between these two REMs - that is, there is no interaction between 
the k{l,N) particles and the k{2,N) particles, as if there is a barrier between these 
two sets of particles. Of course, this is so as long as — < y/2p2 log 2. 
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Let us assume y/2p2 log 2 < ^ < y2Tog2. Then ,5 > ^ means /3a2 > > 
■\/2p2 log 2 where as ^ < y/2 log 2. So the function ^y'^ — (3a2y is decreasing on 
[0, \/2p2 log 2J and the other function |/3ai?/^ — /3a2|/ = /3ai(|?/^ — ^?/) is decreasing 
up to ^ in [v^2p2 log 2, < a/2 log 2] and then increasing. Hence, as is continuous, 
the infimum will occur at ?/ = ^. Thus, the phase transition takes place at /5 = ^. 
So we have the following 

Theorem 2.7.2. In the Exponential-Gaussian GREM, if V2p2 log 2 < 22 < ^2 log 2 

i/ien almost surely, 



lim — logZjv(/3) 

N^oo iV 



log2 + i/3V 



/3(|J + ailog2) z//3>;^ 



As earlier, we can picture the value of S{l3) against (3 as given below. The values 
of (3 are given under the line and values of S{(3) are given above the line. The phase 
transitions occur at the dark lines. 



Subcase A2 



V2p2 log 2 
12 



/?(iJ + ailog2) 



ai 



V2 log 2 
12 



In this case, we observe that the free energy for inverse temperature up to ^ is 
given by log 2 + ^P'^a^. This can be thought of as the Gaussian REM energy but not 

going all the way up to /? < ^ but cut short at ^. This can also be thought 
of as the sum of the two energies Si and £2 as in fl2.7.4p and fl2.7.5l) . but then the 
Gaussian effect is prolonged up to /3 < ^ instead of stopping at v^l^i°il. w^e do 
not know which is the correct interpretation. For P > the system exhibits a new 
phenomenon which we are unable to explain. The term /3ai log 2 is reminiscent of the 

1 

exponential REM energy. The other term 2/^" appears to be new. 



A3: V2T^<gj^ 



Let us assume ^2 log 2 < ^. Then f3 > ^ means f3a2 > ^ < ^2 log 2. So 



both the functions \y'^ — Pa2y and ^Paiy"^ 



0.1 

(3a2y 



[3ai 



^y) are decreasing on 
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[0, A/2p2Tog^] and [\/2p2 log 2, < y/2 log 2] respectively. Hence the infimum will occur 
at y = \/2 log 2. Being /5a2 > \/1 log 2, the phase transition takes place at /5 = 
Hence we have the following 



Theorem 2.7.3. /n the Exponential- Gaussian GREM, if ^ > \J1 log 2 i/ien almost 
surely, 



hm -logZ^(/?) 

N—too I\ 



log2 + i/?2^i ^//5<^ 



V2 log 2 
"2 



As earlier, we can picture the value of against P as given below. The values 
of P are given under the line and values of S{j3) are given above the line. The phase 
transitions occur at the dark lines. 

Subcase A3 



i/32ai + log2 







/3 



V21og2 

0.2 



oi 



Thus in subcase A3, the system behaves like a REM with Gaussian distribu- 
tions [16] having weight 02, that is, as if H]^{a) are i.i.d centered Gaussian with 
variance a2N. For example, when ai = 02 then this is just the standard Gaussian 
REM. It does not depend on the quantities pi and p2- Even when p2 = 0.0001 (very 
small) the first level exponentials do not show up in the limit. Further the GREM 
reduces to a REM. Of course, this is so as long as A/2Tog2 < — . This should be con- 
trasted with subcase Al where the entire system behaves like sum of two independent 
REM, one Gaussian and other exponential. 



2.7.2 Gaussian - Exponential GREM 

Let us consider the situation where the driving distributions at the first level are 
Gaussian, 07v,2 and at the second level they are exponential, (f)N,i- Moreover, as 
earlier ai and 02 are the weights at the first and second level respectively. We will use 
the same notation for k[l, N), k{2, N) and for pi,P2- In this case the general formula 
of Theorem 12.4.61 reduces to the following: 

lim ^ log Zn{(3) = log 2 - inf {/(x, y):x,y>0; x < ^2pi log 2; ^x^ + y < log 2} 
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almost surely, where 

y) = - Pdix + y(l - /3a2). 

In this case to calculate the infimum we proceed as follows. Put g{x) = inf f{x, y). 

y 

Since (l/9a2) > for /3 < ^, we have 

1 1 
g{x) — -x^ — (5aiX if /3 < — 

2 02 

= ]-x^ - (3aix + (1 - (3a2) (log 2 - ^x^) if /3 > — 
2 2 a2 

that is 

g{x) — -x^ — (3a\x if /3 < — 
2 a2 

= (5a2 {]:x^ - -x) + (1 - (5a2) log 2 if /? > — . 

V2 a2 J 02 

Since infimum of /(x, y) is same as that of infimum of g over x, one has to calculate 
inf g{x)- Here, we will have the following two scenarios. 

0<x<V2pi log 2 



Bl: < V2pilog2| 

Let us assume ^ < \/2pi log 2. If /3 < ^, then j3ai < y/2pi log 2. The function 
— /3aix decreases up to /3ai an then increases. Hence when (5 < the infimum 
occurs &X, X — (5ai. For /3 > ^ as ^ < v^2pi log 2, the infimum will occur aX x — ^. 
So we have the following 

Theorem 2.7.4. In the Gaussian-Exponential GREM, if ^ < V^Pi log 2 then almost 
surely, 

1 , „ ... |log2 + i/5V ^/ /5<i 

/?(i| + a2log2) z//5>^ 



hm -\ogZM 



As earlier, we can picture the value of against (5 as given below. The values 
of P are given under the fine and values of S{P) are given above the fine. The phase 
transitions occur at the dark lines. 
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Subcase Bl 



/3 



02 



V2pilog2 



B2: V2pilog2 < ^ 



Let us assume ^ > VSpi log 2. If /3 < ^ we have /3ai < ^. So the quantity /3ai 
will be in [0, y/2pi log 2J as long as /3 < ^/^32ll_ As the function — (5aiX decreases 
up to j3ai an then increases, for /3 < ^'^p'^°^'^ the infimum occurs at x = (5ai. But for 
and for V^Pi^°g^ < < J_ the infimum occurs at x = \/2pi log 2. For 3 > — consider 

ai ' — 02 ' V r- J. o '02 

the function ^x'^ — ^x which decreases up to ^ and then increases. As ^ > \/2pi log 2, 

2 02 f_ 02 02 ^ ° ' 

the infimum will occur at x = V^Pi log 2. Thus we have the following 



Theorem 2.7.5. In the Gaussian- Exponential GREM, if ^ > V^Pi log 2 then almost 



surely, 



hm -logZ^{(3) 

iV-+oo iV 



log2 + i/?2a2 

P2 log 2 + /3ai V2pi log 2 



V2pilog2 



02 



if 



V2pilog2 



^ /3 (ai \/2pi log 2 + a2P2 log 2) if (5 



0.2 



> 



02 



As earher, we can picture the value of S{P) against P as given below. The values 
of /3 are given under the line and values of are given above the line. The phase 
transitions occur at the dark lines. 



Subcase B2 



l/3^al + \og2 







(3ai^/2pi log 2 +P2 log 2 /?(ai V2priog2 + 02^2 log 2) 



/3 



V2pilog2 

Ol 



02 



V21og2 



Remarks similar to Exponential-Gaussian GREM apply here as well. Subcase Bl 
is similar to subcase A2. Here also the term ^P-^ is not reminiscent of anything we 
know. 

Subcase B2 is similar to that of subcase Al. That is in subcase B2, the limiting 
free energy is sum of two REM free energies - one is of Gaussian REM and other is 
of exponential REM. To be precise, the Gaussian REM limiting free energy (keeping 
in mind that for N fixed, the k{l, N) particle system has A/"(0, a'^N) Hamiltonian as 
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opposed to A/'(0, a^A;(l, N)) yields, a.s, 



(2.7.6) 



/3aiV2pilog2 if^> V2aJ2£2 



On the other hand, for fixed A?^, if we have configurations 2'^^^'^^ then the exponential 



REM limiting free energy, with Hamiltonian as a2 times double exponential random 
variable yields, a.s.. 



The reader should note that to compare subcase B2 with subcase Al, we interchange 
02 with ai and p2 with pi (to maintain the same weights and proportions for the 
exponential and Gaussian levels). 

The last interesting note is that in Gaussian-Exponential GREM, the system never 
reduces completely to a Gaussian REM as happened in subcase A3. 

Thus the large deviation technique allows the use of different distributions at differ- 
ent levels leading to some interesting phenomenons. The conclusions of Exponential- 
Gaussian GREM differ from those of Exponential-Gaussian. The system may reduce 
to a Gaussian REM even with a very small weight is associated to that level. Even 
the system may appear as a system of two independent REMs separated by a big wall 
preventing them to interact between each other. Moreover, there are situations where 
we could not explain the terms present in the expression for energy. 




(2.7.7) 



Now it is easy verify that, in subcase B2, a.s. 



Chapter 3 

More Tree Structures including 
Randomness 



In this chapter, we will consider several models similar to that of Generalized Random 
Energy Model. In the previous chapter, we formulated GREM in general tree set up 
with out giving any examples of general tree structures. The set up also allows us 
to randomize the tree structure. First we consider regular trees but the trees are 
random, driven by Poisson random variables. Then we consider non-regular random 
trees again driven by Poisson random variables. We prove that in both the cases 
the free energy exists for almost every tree sequences and they are same as that of 
usual deterministic tree GREMs for almost every sample point. Also we consider 
Multinomial trees. These will be explained later. 

The usual GREM has hierarchical structure, and it is so in all the above mentioned 
models. In 2006, Bolthausen and Kistler [3] defined a model which is a generalization 
of the GREM where the model is no longer hierarchical. They called the model as 
non-hierarchical version of GREM and prove the existence of the free energy by using 
second moment method. Surprisingly, the energy expression is again the same as 
that of the usual GREM. So the non-hierarchy does not play a role in the limiting 
free energy. We produce an alternative proof of their result through large deviation 
techniques and show that the free energy of this model is minimum of certain hidden 
GREMs. Then we introduce another model, block tree GREM where the energy 
is maximum over certain GREM energies. We present further generalization in a 
model, in the next chapter, through which we can get all the models REM, GREM, 
Bolthausen-Kistler model and their versions with the external field. 
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3.1 Regular Poisson GREM 

In generalized random energy model, we have randomness coming from the driving 
distributions. The reformulation of GREM in general tree structure allows us to 
introduce another randomness at the tree level which is independent of the randomness 
of the Hamiltonians. As usual for particles system, let {k{i,N), 1 < i < n} 
be a partition of into n (the level of the tree) positive integers. Consider, for 
each A^, independent random variables Li]y, ■ ■ ■ , Lnjy where Lj^r ~ p(^2^i'^'^^), i.e. a 
Poisson random variable with parameter 2*^^*'^^ for 1 < i < n. Let us construct 
a random tree with (1 + Lj^r) nodes at the i-th level below each node of the (i- 
l)-th level. That is, at the first level there will be 1 + Li^ many edges and at 
the second level there will be total (1 + LiAr)(l + L2n) many edges. Here we are 
considering 1 + Lusf instead of Li^ itself, to take care of the situation Li^ = so 
that each branch in the tree is of length n. Once again we denote the edges at the 
first level by ai and the second level edges below ai as aio'2 and so on. The weight 
of the i-th level is > 0. Similarly we will associate independent random variable 
,^(o"i ■ ■ ■ o"j) with the edge cti ■ ■ ■ cTj. In this case for A^ particle system, instead of 2^ 
configurations we will have (1 + Li7v)(l + L2n) ■ ■ ■ (1 + Lun) many configurations. Of 
course this is also a regular tree, but random, and could be called regular Poisson 
tree. The corresponding GREM model, where the Hamiltonian for the configuration 

n 

a = (cTi, ■ • • , cr„) is defined as Yl (^i^i'^i ■ " ■ ^i)) can be called a regular Poisson tree 

1=1 

GREM with parameter k = {k{l, N), ■ ■ ■ ,k{n, N)). The next result says that if the 
same conditions as in Corollary 12.3.21 hold then even with randomization of tree, the 
conclusion holds for almost every tree sequence. 

Proposition 3.1.1. Consider a regular Poisson tree CREM with parameter k. The 
following is true: 

^) U 2''^^'^^'^ ^'^'^*'^)giiv ■ ■ ■ QiN < oo, for some i,l < i < n then for a.e. tree 

N>1 

sequence, a.s. eventually, /iAr(A) = 0. 

b) If Yl 2-('=(i'^)+-+'=(^'^))gfj^ ■ ■ ■ g^:^^ < oo, for each i = I,-- - then for a.e. 

N>1 

tree sequence the following is true: for any e > 0, a.s. eventually, 
(1 - e)qiN ■■■qnN < /^Af (A) < (1 + e)qiN ■ ■ ■ qnN- 

Proof, a) It suffices to verify the hypothesis of Theorem 12.3.1( a) holds for almost 
every tree sequence, that is, Y BiNqiN ■ ■ ■ qiN < for some i. Recall that Bi^ is 

N>n 

the number of branches at the i-th level. 
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But we could prove a stronger statement, namely, if for some i with 1 < i < 
j2 2'^(i'^)+-+'=(*'^)gi;v ■ ■ ■ giTV < oo, then E Bij^Qm ■ ■ ■ QiN < oo for that i 

N>n N>n 

where is the tree expectation. Since the tree randomness is independent of the 
Hamiltonian randomness, in view of the hypothesis, it suffices to show 

ErBiN < 2fc(i.^)+-+'=(*'^)+\ (3.1.1) 



Using independence of the random variables {Ljjy, 1 < j < i), we get 



EtB,n = eJIII + Ljn) = 11^(1 + Ljn) = 11(1 + 2'^^'"^^) < 2^ J]2'=(^'^). (3.1.2) 

j=i j=i j=i j=i 

b) It is enough to show that for fixed e > 0, almost every tree sequence satisfies 
the stated conclusion. This is achieved by verifying that the hypothesis of Theorem 
12.3.1( b) holds for almost every tree sequence, that is, Yl — < oo. Recall 

N>n * 

that, sfj^ = J2 ■ ■ ■ y'^i) where e((Ti(T2 ■ ■ - cTj) denotes the number of nodes at 

the n-th level below the node o"i(T2 ■ ■ ■cxj and Bn is the number of leaves or the the 
total number of branches in the tree. 

Here also we prove a stronger statement, namely, E^ 2J — ~. — ~ < ^ each 

N>n ^ * 

i where E^ is the tree expectation. Again, since the tree randomness is independent 
of the Hamiltonian randomness, in view of the hypothesis, it suffices to show 

\BnJ 

i n 

But due to regularity of the tree sf^ = 11(1 + Ljn) Yl (1 + ^jN^ and Bj^ = 

j=l j=i+l 



11(1 + Lj^y. Hence 
i=i 



* 1 



Bn 1 + 
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Thus using the independence of the random variables {Ljn, 1 < j < n), we get 

Since for a Poisson random variables X with parameter A, Ejip^ ~ J ^ ^~^) 
and since Ljn ~ P(2'^(*'^)) we have, 



E 



1 + 



N 



2-Hm (^^ _ g-aM^-iv) j ^ 2-^(^'^). (3.1.5) 



Substituting flXT3]) in flXTD we get flXT^ . □ 

Now further if we assume that g) for 1 < i < n and the random 

variables ^{ai---ai) are distributed like 0Ar,^- as defined in fl2.4.ip . the rest of the 
proof for existence of the free energy is the same as that of Theorem 12.4.61 Thus if 
the sequence | | satisfies LDP with good rate function Xj for each i with scale 

parameter A^, then we have the following. 

Theorem 3.1.2. Assume the setup as in the above paragraph. For regular Poisson 
tree GREM, for almost every tree sequences, almost surely, 



1 f 

hm - logZ^m = log 2 - inf <^ V (J, 

L i=l 



where 



<if = <x : '^Ii{xi) < ^pilog2, l<k<n 

I. 1=1 i=l 

Thus, though we have another randomness in the setup of the model, the limiting 
free energy remains the same. That is why, in the original setup of GREM by Derrida, 
though (af') may not be an integer, with out any loss of generality one can consider 
the number of branches at the i-th level to be [af]- We make this more precise in 
Remark 13.3. 1[ 

We also note that, in this model the number of configurations in the configuration 
space may not be of the form , where a ia a natural number. Instead, it is of the 
form /1/2 ■ ■ - In- 

In the above model, there are n random variables controlling the number of nodes 
at the n level of the GREM. Since n is fixed and N ^ 00, one may get the impression 
that this extra randomness is not showing up in the final conclusion, namely, in the 
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expression for the free energy. The next model shows that such an impression is not 
correct. 



3.2 Poisson GREM 

In the above model we randomized the tree sequences so that the underlying trees 
once again remain regular. But the general formulation allow us to consider a non- 
regular tree sequence. In the previous model, for each A^, we randomized the tree 
using n many Poisson random variables corresponding to the levels of the tree. But it 
is conceivable to use independent Poisson variables at each of the nodes to construct 
the tree as well as the configuration space. This is what we do now. As in the previous 
model, let {k{l, N), ■ ■ ■ , k{n, N)} be a partition of N. Unlike in that model, now let 
us consider an n-level tree with P(2'^(*'^^) + 1, many nodes below each of the nodes at 
the {i — l)-th level for 1 < i < n. Here P(2^*^*'^)) denotes a Poisson random variable 
with parameter 2^^*'^^ In other words, instead of fixing one random variable and 
taking so many nodes below each of the {i — l)-th level nodes, we now fix one random 
variable for each node of the {i — l)-th level and take so many nodes below that. Let 
us assume all these Poisson random variables are independent. As in the previous 
model, we denote a typical edge at the i-th level by (Xi ■ ■ ■ cTj below the edge cxi ■ • • crj^i 
and we associate independent random variables ^((Ti • ■ ■ Ci) to it. We assume that this 
family {^{(Ji ■ ■ ■ (Ti)} is independent of the above Poisson family. For 1 < z < n, we 
have a positive number Oj denoting the weights for the 2-th level of the tree. Now we 
define the Hamiltonian for the configuration a = (ai, ■ ■ ■ , cr„) as 

n 

HNicr) = ^ ai^(o-i ■■■en)- 

i=l 

This model can be called a true Poisson tree GREM with parameter k = 
{k{l, N), ■ ■ ■ ,k{n,N)). Here we randomize, rather Poissonize, the tree in its full 
form. Even in this case also the model behaves the same way and we get the same 
conclusions as that of the above model, that is, (a) and (b) of Proposition [371. II remain 
true. This is the content of the next proposition. 

Proposition 3.2.1. Gonsider a Poisson tree GREM with parameter k. The following 
is true: 

^) U Yl 2''^^'^^^"'^''^'^''^^ qij\j ■ ■ ■ qij\j < oo, for some i,l < i < n then for a.e. tree 

N>1 

sequence, a.s. eventually, /iAr(A) = 0. 

b) If Yl 2-('=(i'^)+-+'=(^'^))g-j^ ■ ■ ■ g^:^^ < oo, for each i = I,-- - ,n, then for a.e. 

N>1 
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tree sequence the following is true: for any e > 0, a.s. eventually, 

(1 - €)qm ■■■qnN < AiJv(A) < (1 + €)qiN ■ ■ ■ qnN- 

We need the following two inequalities to prove the proposition. 
Let a > 1, b > 1 and A > 0. Suppose that X ~ P(aA) and Y ~ P{bX) are 
independent random variables. Then 



and 



{X + Y + a + by- (a + 6)2 A" ^ 

Both these rely on conditioning. Since X and Y are independent Poisson random 
variables, given X + Y — I, the conditional distribution of X is binomial (l, So 



E 

=E 
=E 



X + a 



X+Y+a+b 

1 a\X + Y + a + by + {X + Y)ab 



{X + Y + a + by (a + by 

2 



a 

<- — - smce a > 1, 



(a + by 



and 



E 



X + a 
{X + Y + a + b)\ 



=E 



1 a(X + y + a + 6) 

(X + Y + a + b)^ (a + b) 

E 



(a + b) (X + Y + a + b) 
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a 1 

<- ttE-— — — — — since a + > 1 

(a + b) {X + Y + 1) 



a 

< 



A(a + 6)2' 

Proof of Proposition \3.2.1\ The proof is routine but involves rather cumbersome no- 
tation. To describe the random tree for the A^-particle system, let Lq ~ p[2^0-'^'>)^ 
the number of edges at the first level. For 1 < cti < -Lq + 1, let L^^ ~ P(^2''^'^'^^), 
the number of edges at the second level below the first level edge cxi. In general, for 

criO-2 ■ ■ ■ O-j, with 1 < (Tl < Lo + 1, 1 < (72 < -^^<7i + 1, 1 < CTi < + 1, 

let -Z^o-i-.-CTi ~ p(^2''^^^^'^^), the number of edges at the {i + l)-th level below the edge 
o"i ■ ■ • o"i_i at the i-th level. 

To prove part (a), it suffices to show, as in Proposition I3.1.H that 

Since, in this model 

BiN = X] ■ ■ ■ X^(-^<71-<7,_1 + 1), 

the proof is immediate. 

To prove (b), as in Proposition 13. 1 . 1| it suffices to show that for each i, 



fi2 



But in this model. 



s2 ^ ^ 



Bj, ^"'^yE..---E..jL^.-<..-.+i) 

To calculate the expectation we proceed as follows. Let J-'o be the a-field generated 
by Lq, T\ be the cr-field generated by {Lq, L,,^ : 1 < cri < Lq + 1} and in general 
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J-'i be the a-field generated by {Lq, L^r^, ■ ■ ■ ,L„^...fj- : 1 < (Xi < + 1, 1 < < 
Lo-i + 1, ■ ■ ■ , 1 < cTj < Lg-j ... o-- .^} for i = 0, ■ ■ ■ , n — 2. Let Ej be the conditional 
expectation given jFj. Then (13.2.1 1) suggests that, 



En-2 ( ) - En-2 ' ' ' E ( 



( Zo-j + i ' " " X]o-„_i -^g-l-'-O-n-l + ^O-^ + l " " " Zo-n-1 ^ 



2 



/ Xlo-i + i ■ ■ ■ X]o-„_2 (-^0-l---0-n-2 + 1) 



Similarly, 



and thus 



Now we can use (I3.2.2I) to calculate further conditional expectations so that 



and so on to get 



EqEi ■ • • E„_2 < 2 



s2 , 



Since E^ ( ^ j = EqEi ■ ■ ■ E„„2 (b^^) ' proof is complete. □ 



Once again to verify the existence of limiting free energy, one has to verify all the 

' ' i,N ' 
N 



steps involved in section 12.41 To be precise, if we assume that Pi (> 0) for 



1 < z < n, the sequence | ^(""i^'""') | satisfies LDP with good rate function Xj for each 
i, then we get 
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Theorem 3.2.2. For almost every tree sequences, almost surely, 

[xi) - iSaiXi) 



where 



1 f " 

lim - logZjv(/3) = log2 - inf <^ V 



^ = < X G M" : ^Ii{xi) < ^paog2, 1 < A; < 



_ — n 

i=l i=l 

Remark 3.2.1. Recall that a tree is regular if for any i, the number of nodes below a 
{i — l)-th level node depends only on i and not on the specific nodes. We say that a 
tree sequence is regular if after some stage each tree in the sequence is regular. Under 

suitable conditions for instance, when ^ ^-Ki^^) ^ some i it is possible 

to show that almost every tree sequence ceases to be regular. Though in this case the 
probability that the tree sequence will consist of regular trees is very small, we did 
not get any further new result except that, now the limiting free energy is constant 
for almost every tree sequences as well as almost every sample points. 



3.3 Multinomial tree GREM 

In the above two models, we randomized the number of nodes at each level keeping 
the average fixed. It is also possible to randomize the vector k suitably. To do that, 

we fix Pi > for 1 < z < n with = 1. Now consider an ra-faced die with pi being 

1 

the chance of face i appearing in a throw. Now we can consider two experiments with 
this die. Firstly, we can consider an indefinite throws of the die and for particle 
system let K{i, N) be the number of times face i appears in first throws. In the 
second, experiment for particle system we will throw the die independently N times 
and observe the outcomes. With the same notation, let K{i, N) be the number of 

n 

times face i appears. Clearly, in both the cases K{i, N) > and ^ K{i, N) = N. We 

i=l 

can consider GREM with parameter K, that is where the under lying tree has 2^^'^'^'^ 
many edges below each of the {i — 1) level node. These can be called multinomial 
tree GREM of first kind and multinomial tree GREM of second kind with parameter 
p = {pi, ■ ■ ■ ,pn) respectively. With the same notation as in section [2731 in this case 
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we have 



TV - . / \ «: / \ N-k / 4 \ N 



ErB,, = E n 2"-'-"' = E 2' ( i ) (E ft) (l - E ft) = + ) ' 

But for any x, as (1 + x)^ < e^^, we have 

i 

ErBiN < e""^"' = 2i^(P^+-+^»\ (3.3.1) 
On the other hand, in this case, = 2^(i'^)+-+^(^'^)22(^((^+i)'^)+-+^(-.^)) and 

n I 

2 E K{j,N) ,2\ -Y:K{j,N) 

B^j = 2 so that = 2 1 . Once again using the fact that 

i i 

N) is binomial with parameters and YlPj^ see that 
1 1 



E2 1 



N 

T,K{j,N) 



2 1 



Hence by same inequahty as earher, we have 

E2-¥'"'''"<e-*?'"=2--fe¥". (3.3.2) 

Combining the above observations (13.3.11) and (13.3.21) . we have the following 

Corollary 3.3.1. Consider a multinomial tree CREM either of first kind or of second 
kind with parameter p. Let A = Ai x • • ■ x A„ he a box in M" and qny = P(^^^^^) G 
A,. 

^) V 2'^^^'^ '"^'"'^'lAT ■ ■ ■ QiN < oo, for some i,l < i < n then for a.e. tree 

N>1 

sequence, a.s. eventually, /iAr(A) = 0. 

U Y 2^2iog2(pi+'"+P*)g-^ . . . q-^ < for gQc/i i = 1, ■ ■ ■ ,n, then for a.e. tree 

N>1 

sequence the following is true: for any e > 0, a.s. eventually. 



(1 - e)qiN ■■■qnN < yU7v(A) < (1 + e)qiN ■ ■ ■ qnN- 
Notice the difference in the hypothesis of (a) and (b) in the above corollary. More 
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specifically, there is a factor ^ extra in the exponent of 2 in part (b). This difference 
will not help us to obtain the exact support of the empirical measure fiN- For the 
variational problem this support is very essential. But, of course, we have strong law 
of large number in our hand. By SLLN, almost surely, j^{K{l, N), ■ ■ ■ , K{n, N)) — > 
(pi, ■ ■ ■ ,Pn) in both the models, first or second kind. In the first case, we use SLLN 
for a sequence of i. i. d. random variables and in the second case we use SLLN for 
array of rowwise independent random variables [27]. According to Hu et al in [27] : // 
{Xnk} be an array of rowwise independent random variables such that EX^k = and 
there exists a random variable X with EX^ < oo so that for all n and k and for all 

n 

t > 0, P(|X„fc| > t) < -P(|^| > t), then \ ^ Xnk almost surely. Since we can 

" k=l 

N 

write K{i,N) as ^ X}^'^ where X^^'j, are Bernoulli with success probability Pi, the 

k=l 

above result is applicable for the second kind model. Thus, in either of the cases, for 
every e > almost every tree sequences after some stage 

N{pi + ■ ■ ■ + Pi - e) < K{1, A^) + ■ ■ ■ + K{i, N) < N{pi + ■ ■ ■ + pi + e) 

for i = 1, - ■ ■ ,n. That is for almost every tree sequences and for any arbitrary e > 0, 
BiN < 2^(Pi+-+P'+^) and '-h < 2-^(pi+-+p»-^) for z = 1, ■ ■ ■ , n. As a consequence, we 
can restate Theorem 12.3.11 as follows 

Corollary 3.3.2. Consider a multinomial tree GREM with parameter p. Let A = 
Al X ■ ■ • X A„ 6e a box m M" and qtN = P{^^^j^) G A^. Let e > 0. 

U Yl 2^^2^^P^^"'^P^'>qiN- ■ ■ ■ QiN < oo, for some i,l < i < n then for a.e. tree 

N>1 

sequence, a.s. eventually, /i7v(A) = 0. 

b) If J2 2^^2-^(Pi+-+P")gfj^ ■ • • qT}^ < oo, for each i = l,--- ,n, then for a.e. tree 

N>1 

sequence the following is true: a.s. eventually, 



(1 - e)qiN ■■■qnN < /i7v(A) < (1 + e)qiN ■ ■ ■ qnN- 

Now the proof of existence of the asymptotic free energy for this model is routine 
and for almost every tree sequence the expression for free energy will be same as that 
of the deterministic tree model where ^ p,i for 1 < i < n. To state precisely, 

let us we assume that the sequence | satisfies LDP with good rate function 

Xj for each i, then we have the following. 



Theorem 3.3.3. With the setup as in the above paragraph, for almost every tree 
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sequence, almost surely, 



where 



1 f " 

lim - logZ^(/3) = log2 - inf <^ V (J,i 
N iV xe* ^ — ^ 

L i=i 



^ = < X e : ^^liixi) < ^pilog2, l<k< 



_ _ n 

1=1 1=1 



Remark 3.3.1. Going back to Theorem 12.3.11 let (T/v) and (T/v) be two sequences of 
trees. Suppose there are numbers C > c > such that for each i, c < < C and 



c < 1^ < C. Then it is easy to see that, hypothesis of Theorem 12.3.1( b) holds for 
(T/v) iff it holds for (T/v). Accordingly, the conclusion of Theorem 12.3.1( b) holds for 
(T/v) iff it holds for (T/v). Same remark applies for Theorem 12.3.1( a). 



3.4 Bolthausen - Kistler GREM 

In 2006, Bolthausen and Kistler proposed a model where they tried to go beyond the 
natural ultrametricity of the GREM model. To recall, a metric d is ultrametric if in the 
metric property one replaces the triangle inequality by d{x, z) < max((i(x, y), dijj, z)). 
In all of the above GREM model one can define a metric on the configuration space 
Stv of the particle system through the covariance structure of the Hamiltonian. To 
be precise for two configurations a and r in S/v, 

d{a, t) = ^/E{HN{a)-Hr,{T)y. (3.4.1) 



Here as usual, Hn{(t) = ^ ai^(cri ■ ■ - cTj) with the usual GREM notation. Let a = 

i=l 

(cTi, ■ ■ ■ , CT„) and r = (ri, ■ ■ ■ , r„). If cr = r then d{a, r) = 0. If (Xj = for 1 < i < 

r-n 

k < n but o-(k+i) 7^ T{k+i) then d{a,T) = w2 X] cilE^{ai ■ ■ -CTj)^, assuming that the 

y i=k+i 

(^'s are symmetric with finite variance. The distance between any two configuration 
will be maximum, when they differ at the first level of the tree. The longer the initial 
segment of a and r coincide, the closer they are. Also it is quite easy to verify that 
this metric indeed is an ultrametric. To see this, it is enough to see the level of 
difference among the configurations. Suppose, we have any three configurations cr, r 
and ?7 in S/v. Let fcj, k2 and k^ be the maximum non-negative integers so that cTj = 
ioi 1 < i < ki\ ai = rji for 1 < i < ^2 and Ti = rji for 1 < i < k^ respectively. 
To show (i(cr, r) < max((i((T, 77), (i(?7, r)), we only need to show that ki > min(/c2,^3)- 
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Without loss of generality if we assume ^2 < then a = rji ■ ■ -rfk^cyk^i+i ■ ■ • o"n and 
r = r]i-- -rik^Tk^+i ■ ■ - r^. If ki < k2, then cTfc^+i = r^^^+i = r^^+i will contradict the 
maximality of ki. 

We will denote the model of Bolthausen and Kistler as BK-GREM. The set up 

in the BK-GREM is the following: For a fixed number n e N, they consider the 

set / = {1,2, ■■■ ,n} and a collection of non-negative real numbers {aj}jci such 

that ^ aj = 1 with a0 = 0. There may be subsets J of I for which aj = 0, 
Jc/ 

so they consider Vi as that collection of subsets J of / for which aj > that is, 
Vj = {J : aj > 0}. Like in the usual GREM, they fix n positive real numbers 7^ 

n 

for i G / so that ^ 7^ = 1 and split the configurations space Sat = { — 1, 1}^ in to 

i=l 

products S^-^AT X E^jAT x ■ ■ ■ x S^„Ar with S^^at = {—1, l}"^'^ for each i. Since 'jiN 
may not be an integer one needs to use [jiN] instead of 'jiN, and YYi=i ^biN] as 
the configuration space etc. Since we shall soon reformulate this model we do not 
elaborate on these points. So a configuration a can be written as (cti, ■ ■ ■ ,cr„,). For 

k 

J = {ji, ■ ■ ■ ,jk} C /, denote Sj^r for Y[ ^-jjjN and ctj for the projected configuration 
{aj)j(zj G SjAT. In this setup, the random Hamiltonian is defined as 

Je-Pj 

where for J eVj and aj G Sj^a^ the random variables ij{crj) are independent centered 
Gaussian random variables with variance ajN . It is quite easy to verify that, in this 
model if we define the metric on the configurations space by the same formula as 
in (13.4. ip then the metric will not be always an ultrametric. But yet they have 
shown that the limiting free energy is again a GREM free energy. To be precise, 
define a chain {Aq, Ai, - ■ ■ , A^) to be an increasing sequence of subsets of / with 
= Aq ^ Ai (Z ■ ■ ■ <Z Ak = I. What they have shown is that for any BK-GREM 
there exists a chain (Aq, Ai, ■ ■ ■ , A^) and positive constants for 1 < i < k with 

k 

^ ttj = 1 such that the following holds: the free energy of this BK-GREM is same as 

i=l 

that of a Gaussian k level GREM energy where random variables at the i-th level have 
variance a^iV. This means, once again the limiting free energy does not go beyond the 
GREM one. We will present here an alternative and elegant proof of this thanks to 
large deviation results. To do that we will reformulate the model in the next section. 

3.4.1 Reformulation 

We formulate BK-GREM as follows. Fix a set / = {1,2, ■ ■ ■ , n} with n>l. Let 
iV > n be the number of particles, each of which can have two states/spins +1, — 1; 
so that the configuration space is Eat = 2^. Consider a partition of into integers 
k{i, N),l < i < n with each k{i, N) > 1 and Yl N) = N. We will as usual think 
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of 2^ as Hie/ S'^^*'^) and a G 2^ is (Ti ■ ■ ■ (t„ where G 2'=(*'^). Let S be the collection 
of non-empty subset of I. For each element s in we denote 2^"'^ = JJ^ 2^^'^'^\ With 

this notation 2^ = 2^"^ . The map a G 2^-^^ — cr(s) G 2^^^ is the projection map 
via s. For s = {ii, ■ ■ , ik] ^ 5' where ii < 12 < ■ ■ ■ < ik and cr = di ■ ■ ■ (T„ G 2^-^^, 
we denote a{s) = ai^ai^ ' ' '^ik ^ 2^"^, the projection of a via s. Now for fixed A^, we 
have a bunch of independent random variables ^(s, cr{s)) as s varies over S and (t(s) 
varies over 2'^''^. 

For each a G 2^ one can think of a lattice isomorphic to the lattice of power set of / 
where cr(s) corresponds to the edge of the lattice joining the nodes s = {ii, 12, - ■ ■ , ik} 
and {ii,i2, ■ ■ ■ , ik-i}- Now for each a associate random variables ^(s, cr(s)) to each of 
the lattice edge <7{s). We associate weights > to each edges ^(s). These are not 
random. In a configuration a = (Ti ■ ■ ■ (j„ the Hamiltonian is defined as 

if^(a) = iV^a,e(s,a(s)). 
For /3 > the partition function is 

Here E^- stands for expectation with respect to cr when 2^ has uniform distribution. 
In other words, E^- is simply the usual average over a. 

Since ^'s are random variables both Hjsr and Z^r are random variables. We suppress 
the parameter u that comes with the random variables ^. As usual log Zjv(/5) is 
the free energy of the A^-particle system. As A^ changes, the distribution of the ^'s 
would in general depends on A^. So strictly speaking we should be using superscript 
A^ for the random variables. But for ease in reading we suppress the superscript. This 
should be borne in mind. We assume that all our random variables are defined on 
one probability space. 

3.4.2 LDP Approach 

In this subsection, we outline how large deviation principle can be used. Since we 
will prove a more general result in the next chapter (see section [^^3!) . we refrain from 
giving complete details. Let us consider the map E^v — ^ M'^ (recall S is the collection 
of non-empty subsets of /) defined by 

(^^^a = (^(s,cr(s))),65. 

Let fiN he the induced probabihty on M.^ when T,^ has uniform distribution, that 
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is, each a G Sat has probabihty In other words, for any Borel set A C M"^, 



= : e A}. 



In particular, if A is a box, say A = Hsgs with each As C M, then 

o- S&S 

Here now is the basic observation similar to that of Theorem I2.3.1I 

Theorem 3.4.1. Let A = Il.es ^ Denote q^N = P{^{s,a{s)) E A,) for 

s E S. For t E S we denote Ylsct IsN by QtN and Yliet ^(^5 ^) by Kin ■ 

a) If Yli '^^"^QtN < 00, for some t E S then a.s. eventually, yUAr(A) = 0. 

Af>l 

b) If for all t E S, Yl '^~^^^QtN ^ then for any e > a.s. eventually, 

N>1 

(1 - e)E/i^(A) < /i^(A) < (1 + e)E/i;v(A). 

There is no new idea, we need to verify that arguments of the previous chapter go 
through. 

Proof a) Let t be such that "^^"^"^QtN < 00. Then 

Af>l 

MA) = ^EniA.(e(^,a(.))) 

^ cr ses 
oN-KtN 

^ (7(t) sCt 

^ EniA.(e(s,o^(^))) = G^,(say). 



^ a{t) sCt 



Let Aj^ = {Gj\f = 0}. Observe that 



(7(t) sCt 
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Now by Chebyshev's inequality, 

a{i) sCt 

Thus by assumption and Borel-Cantelli, An will occur a.s. eventually, i.e. Gn — 
and hence Hn{^) = 0. 
b) 



Var(//^(A)) 
-E(/.^,(A))2-(E/x^(A))2 



22N 



-y 

2N 



2 

IN 



ses 



E E 

t&S a(t)=T{t) 



E n a{s))) n iA.(e(5, r(.))) - g 



2 

/AT 



-2- Eg,/ 



te5 



Hence for any e > 0, by Chebyshev's inequality 



P(|/.^r(A) - E/.^(A)| > eE/.jv(A)) < 1 J] 



But, in view of the assumption, the sum over N of the right side is finite. So by 
Borel-Cantelli lemma, a.s. eventually. 



(1 - e)E//iv(A) < //Ar(A) < (1 + e)E//^(A). 
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□ 

Suppose that for each element s G S", we have a sequence of probabihties {A^ : 
N > n} obeying LDP with a good convex rate function Xs{x). We now consider 
reformulated BK-model where each ^{s,a{s)) has distribution A^. Thus for fixed N, 
we have a bunch of independent random variables ^(s, cr(s)) as s and a{s) vary. For 
example, for the N particle system, one can consider for each s E S; C,{s,a{s)) to be 
i.i.d. having density 

Mx) = f^)"^ e-^^-^ -oo<x<oo. (3.4.2) 

Let us denote 

^ = {J G : ^X,{xs) < ^p^hg2, Vt G S} (3.4.3) 

sct iet 

and the map J -.R^ ^R, defined by, 

Jix) = ZM^s) if£G^ 

ses 

= oo otherwise. 

Then with the help of Theorem 13.4.11 one can mimic the steps in Theorem 12.5.11 
to get 

Theorem 3.4.2. In the reformulated BK-GREM, let ^ > as N oo for 

1 < i < n. Then almost surely, the sequence {fJ,N, N > 1} satisfies LDP with rate 
function J defined above. 

In this way, once again Varadhan's lemma will ensure the existence of the limit- 
ing free energy in this case also. Thus though we don't have ultrametricity on the 
configuration space, the simple LDP technique works. 

Theorem 3.4.3. In reformulated BK-GREM, almost surely 

lim ^ logZiyiP) = log 2 - inf {PusXs +Is{xs)) . 

Remark 3.4.1. One may feel that the reformulation of BK-GREM is not exactly similar 
to the original version of Bolthausen and Kistler. They consider only those subsets s 
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of I for which 7^ whereas in the above reformulation all the non-empty subsets 
are considered. But from the above theorem it is easy to check that in calculating 
infimum, X, being non-negative and X,(0) = 0, the terms corresponding to those s G S* 

for which = will not contribute. 

In [3], Bolthausen and Kistler identified the free energy of this model as minimum 
of several GREMs associated with, what they call, chains. We shall show that there 
are n\ many n level usual GREMs hidden in the above model. The method used above 
also identifies the free energy of the BK-GREM as the minimum of the free energies 
of these n! GREMs. This is what we do in the next section. 



In this section, we consider the BK-GREM, that is, we take 5* to be the set of all 
increasing sequences of elements of / with the Gaussian driving distributions. As 
mentioned earlier, this is nothing but the Bolthausen-Kistler's model since such se- 
quences correspond to non-empty subsets of /. Suppose now for s G 5* the associated 
weight is a^. Here we evaluate the explicit expression for the limiting free energy of 
BK-GREM. Though it is possible to consider different driving distributions for each 
s G S', a general closed form expression appears to be difficult. Of course, we could 
also start with some more general driving distributions than Gaussian, like distribu- 
tion having density (p as in fl3.4.2l) with 7 > 1 at all the levels. Since in that case, 
there is no new idea needed, we restrict ourselves to Gaussian case for notational 
simplicity. It is worth mentioning here that, in [3], Bolthausen and Kistler evaluate 
the expression of the limiting free energy in two steps. In the first step, they define 
a chain as a sequence of strictly increasing sequences of subsets {Aq, Ai, - ■ ■ , A^) of 
/ so that = Aq G Ai G ■ ■ ■ G A^ = I- For such a chain they associated a K level 
GREM with appropriate weights calculated from the weights of the original model. 
Then by second moment estimates, they have shown that the limiting free energy of 
each such GREM associated to a chain is an almost sure upper bound for the limiting 
free energy of their model. In the second step, they constructed a chain in which the 
free energy of the BK-GREM is attained. 

Here we get the expression for the limiting free energy by calculating 



3.5 Hidden Tree GREMs 




(3.5.1) 



where 



^ = G M'^ : < ^2pilog2, Vs G S}. 



tCs i£s 



Note that \I' is same as that of f l3.4.3p with Is{xs) 



\xl. As earlier, the above 
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infimum is same as 



(3.5.2) 



where 

^+ = {^GM^: < ^2pilog2 &x, > 0, Vs G ^}. (3.5.3) 

tCs i&s 

To evaluate (13.5.2p . we start with some notations. Here the ideas are very much similar 
to that of Bolthausen and Kistler. A new idea is the introduction of permutations of 
{1, 2, ■ ■ ■ ,n} justifying the title of this section. For AC/, let us define 

Pa = ^Pi 

and 



sQA 

ses 



with wl, = 0. Let Vq = Vj denote the set of permutations of /. With this notation, 
for n eVi and < i < j < n, denote 



^Mi),-,^(i)} 

where, for i = the set {vr(l), ■ ■ ■ ,vr(z)} that appears in the denominator is treated 
as the empty set. 

Note that earlier to evaluate the explicit energy expression for 7-GREM with 
7 > 1, in subsection 12.6.11 we consider only one triangular array of numbers defined 
as B{j, k) in fl2.6.4p for 1 < j < k < n. Now here we are considering n! many 
triangular arrays corresponding to each permutation vr. 

Now let, 

Pi = min min BL = min_Bn,-. (3.5.5) 

Also note that, in subsection 12.6.11 we define jSi in (12.6.51) as the minimum over 
all the entries in the first row of the triangular array B{j, k) in (12.6.41) . Here we are 
defining jSi as the minimum over all the entries in the first rows of all the n\ triangular 
arrays. 

It may be further noted that in subsection 12.6.11 to define (3i if the minimum 
occurred at two places, we had taken the maximum index (see (I2.6.4p for the definition 
of ri). We now implement the same plan in the present setting also. Since the 
minimum may be attained in the first lines of two different triangular arrays, one 
needs to know what is meant by maximum index. This will be done now. 

Suppose the minimum occurs at two places, say at {9,k) and {g,l), that is, /?i = 
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= B^^. Let G = {6(1), ■ ■ ■ , e{k)}; H = ■ ■ ■ , Let \GUH\=m and let 

Vi denote the class of all permutations of / for which {vr(l), ■ ■ ■ , 7r(m)} = GU H. So 
Vi C Pi. 

Claim: fij„^ = rf^f^^^}^^ = jor every n eVi. 
To justify the claim, first of all note that 



whereas, 



Then for tt G Pi, 



Vgvjh = Pg + Ph - PcnH- 



<2{pg + Ph- Pghh) log 2 - (i^G + - ^Gn/f) 

= (2pG log 2 - Plwl) + {2pH log 2 - /J^u;!,) + - pGnn) 

as /3i = Bqi^ = Bqi, first two terms are zero, 

= Pl^GHH - PcnH 
< 0. 

The last inequality follows from the fact that (3i is obtained by taking the minimum 
over all possible choice of (vr,j). This shows (3f > ^pgu^l12e1_ 

Once again jSi being the minimum over all possible choice of (7r,j), we conclude 
that /5? actually equals '^'^^^^ ^ proving the claim. 

If the minimum in (13.5.51) occurs at more than two places, we can use induction to 
conclude that there exists a unique maximal set, say, Gi C J such that the following 
holds. Let |Gi| = li and Vi = all permutations that map {1, 2, ■•• /i} on to Gi. 
Then for any vr G Pi, B^i^ = 

It may happen that Gi = I, then we will stop. Otherwise, let us define 

do = min min BT , = min BT , . 

Of course, the last minimum is only over tt G Pi. 

Once again going back to subsection 12.6.11 to define j32 in fl2.6.5p . we look only the 
entries from the ri + 1-th row of the triangular array B{j, k) in (12.6.41) . Here, we are 
looking the entries of li + 1-th rows (as corresponds the first row) of the triangular 
arrays corresponding to each tt G Pi. 
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If possible, suppose the minimum occurs at two places, say, at {9, k) and at {g, I). 
So 9,0 e Pi; h < k,l < n and p2 = B^^ = 5f^,. Let G = {^(l),-- - ,9{k)}; 
H = {g{l), ■ ■ ■ , s-'^d G2 = G U H. Let IG2I = h and denote by V2, the class 
of all permutations of Vi for which {7r(l), • • • ,7r(/2)} = ^2- Since tt in Vi already 
maps {1, 2, ■ ■ ■ ,li} onto Gi, this extra condition only means that tt moreover maps 
{/i + 1, • • • , I2} onto G2 - Gi. Clearly, V2 C Vi. 

Claim: 5/;^^ = J — 3 — = /32; /or every tt G ^2- 

The justification of this claim is similar to that of the earlier one. Once again note 
that, 

but 

PG2-G1 — PG-Gi +PH-Gi — PGCiH-Gi- 

So for TT e P2, we have 



2PG2-Gi log 2 - Pi {wl^ - wlj 

< 2 (PG-Gi + PH-Gi - PGnH-G,) log 2 - /3| (w^ + - w^n^ - W^J 
= 2 (pg-Gi + Ph-Gi - PGnH-Gi) log 2- 

= (2pG-Gi log2 - (51 {wl - wl^)) + {2pH-G, log2 - (51 {wjj - w^^)) + 

{Pi i'^GHH -^k)- '^PGHH-G, log 2) 

= Pi {wonH - f^Gi) - '^PGnH-G, log 2 

< 0. 



2 ^ 2pG2-Gilog2 



Hence (^2 > — 5 2 — once again /32 being the minimum over all possible 



«^G2 - ^Gi 

choice of TT e 7^1 and h < j < n, the only possibility left is the equality. That is 
P2 = ^'^n '^^ and hence the claim is proved. 

^G2 - ^Gi 

If the minimum occurs at more than two places, we can use induction to conclude 
that there exists a unique maximal set, say, G21 such that Gi (Z G2 ^ I and the 
following holds. Let \G2\ = I2 and V2 be all permutations of Vj that map {/i + 
1, • • • ,^2} onto G2 - Gi. Then B^i^iGi) = p2 for all tt e V2. Of course, all the 
quantity I2, P2 depend on Gi. 

Proceeding by induction can summarize: 

There is a (unique) integer K with 1 < K < n and for every i with 1 < i < K 
there are Pi, k, Gi and Vi satisfying the following: 

1. = Go C Gi C • • • C = / with IGJ = k so that 1 < /i < /2 < • • • < = 
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2. Vi is the set of permutations tt of / that maps {1, 2, ■ ■ ■ onto Gj for each 
j <iso that ■■■ dVk- 



3. Pi = Bf_^i^ for every n E Vi and this common value is also same as 
min min B7 ■. 

So note that for any tt G Vk, we can trace out the f3i for 1 < i < i^, as 



2(p^(;,_i+i) + ---+p^(z,))log2 



(3.5.6) 



Moreover, the infimum in (13.5.21) reduces to the following: 



and 



^ = {x: Xl;< 2pclog2, V(0 ^)C C /} 
where we used the notation 



(3.5.7) 



(3.5.8) 



scc,se5 



Now we prove that, if (3j < [5 < (3j+i, the above infimum is attained at x* 
{x*;seS)e given by 



Pitts ii s C Gi 

^20.3 if s C G2, s <^Gi 



Pjttg if s C Gj, s ^ Gj^i 
[Ptts iis^Gj. 



First of all note that x* E For, G ^ I implies 



X. 



c 



scc,ses 



X, 
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{j+l)/\K 

E E 

sCC,seS 



X 



*2 



E E + io-M<^} E 

scc,ses 



*=1 sCGi,s^Gf_i 
sCC,se5 



E 1 



E "l+.'-S 

sCC,s6S 



sCC,seS 



< 



sCC,seS 



E E 



^G, - ^'G.-i 



scc,ses 



2p(GuG,-i)-G,-il0g2 ^2 , T 
2 + Hj+1<K} 



«=1 sCGi,s^Gi_i t.UG,_i G,_i 
sCC,s65 



E 

s^Gj 

scc,ses 



2P(GUG,)-G,l0g2 ^2 



< E 2P(GuGi_i)-Gi_i log 2 + l{j+i<i^}2p(cuG^)-G,- log 2 
= 2pclog2. 

Secondly, note that for any x E we have 



EE"' 

i=l sCGi,s^Gi-i 



^)>E E ^:(^:-^^)- 

»=1 sCGi,s^Gi-i 



For, by Holder's inequahty we have 



E E ^ , 

j=l sCGi,s^Gi-i \ 



XZ 



E E . 

i=l sCGi,s^Gi_i \ 



E E -^^E E 

j=l sCG<,s^Gi_i «=1 sQGi,s<^Gi-i 



.*2 



where the last inequality follows from the fact that x e ^. Hence 



E E <{<-^s)>^. 
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Since /3 > Pj, we have ^^ — 1^ >Oforl<i<j. Moreover /3j being increasing in i, 
these numbers (^j: — are decreasing and hence we get, 

i=l V^' / sCGi,s^Gi-i 

In other words using the definition of x* we get the observation 



Now by using the above inequahty, we have 

sCGj sCGj 

= E {lxl-Pas{xs-x*)-lx^^) 

sCG, 
sCGj 

= 1 E (^.-^:)'>o. 

sCGj 

Moreover, using the definition of x* for s <^Gj, we have 

E - pasXs) - E - /3a,a;*) 

= E {\xl-(5a,xs + \(5^af) 

= I E - z^^:.)' > 0. 



Thus combining the above two inequahty. 



and hence the infimum occurs at x* . 

Denote /3o = and I3k+i — oo. Suppose 1 < j < K and /3 e then the 
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infimum in (13.5.71) becomes 

= E a^-/5EA E a's-lP'Eal 

= PG,iog2-/?x:A E «s-|/3'E«s 

We can summarize the above discussion in the following 
Theorem 3.5.1. In the Gaussian BK-GREM, almost surely, 

hm 1 log Z;v(/3) = 5^ p.log 2 + /3 ^ A E + E 

z//5e[/5,,/?,+i)/orO<j<K. 

We shall now describe for each vr G P/ an n level GREM. In what follows vr G is 
fixed. For the particle system there are 2'^'*^'^*^*^'^^ furcations at the iih. level, below 
each node of the {i — l)th level. The weights at the i-ih level in this GREM are ^(vr, i) 
which are defined by ^(vr, 1) = 07^(1), and in general, for 1 <i <n 

s^{7r(l),-,7r(i-l)} 

Let S{n,(3) be the almost sure limiting free energy of this GREM. This exists by 
Theorem 12.4.61 As done in subsection 12.6.11 we set = and let 

= min , 

with = max{/ > Vi^i : B{rJ_-^^, I) = (3i} for 1 < z < K'^ with r^^ = n. Also denote 
Pq = and Pxn^i = 00. Then by Theorem 12.6. H we have for P G [PJ,PJ^i) with 
< J < K^, 



n 1 " ' 

£{7i,P)= J2 Pni^)log2 + -P^ Yl w\n,i)+pJ2P>'i^^^)- (3-5.10) 

i=rj+l i=rj+l 1=1 

Now let us consider vr G Vk- Then note that U = rj for all 1 < i < -ft' and by 
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definition (3^ is same as (3^ . Hence ^ ]?7r(i) = ^ Pi! ^ "^s ~ 'w'^{n,i) and 

n 

E «H E ^'^(Tr, 0, so that £{P) = £:(7r, P). 

Thus for every n G Px, the GREM associated in the above paragraph has the 
same energy, namely, S{/3), the energy of the BK-GREM. 

We now go on to show that if tt is any permutation then the energy of the GREM 
associated with tt, namely S{n,j3), is larger than £{f3). So fix a permutation tt. 

Denote H[ = {s : s C {7r(l), ■ ■ ■ , n{i)}Ls ^ {7r(l), ■■■ 1)}}, that is, 

consists of all subsets of {7r(l), • • • , 7r(i)} that include 7r(i). Then 



n 

E E 

( 



«=1 se5,sC{7r(l),--- ,7r(i)} 
5^{7r(l),-,7r(i-l)} 



1 2_/o 



sE 

j=i 



E 



sC{7r(l),--- ,7r(j)} 
s^{7r(l),-,7r(i-l)} 



2 ^ 



E 



sC{7r(l),-- ,7r(i)} 



E 



sC{7r(l),-- ,7r(i)} 



ys^{7r(l),-,7r(i-l)} y ys^{7r(l),-,7r(i-l)} y j 



since for C I, E "^s^^s < E 



E 



wcXc, 



ses,scc 



ses,scc 



^ses,scc 



=E 

i=l 



- (5w{'K,i)XHf ) . 



Moreover, for n eVi, let us denote 



T = {^{1(1),- Mi)} - .'^W} log 2, 1 < i < n} 

k k ') 

E^lf. < E2p.(i)log2, Vl<A;<nlcM^. 

i=l ' i=l J 



Then ^ C for every tt e P/. Hence for every tt e P/, we have 



S^S 2—1 
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and hence 

S{P) =log2-inf E (K-/?a,a;,) 

n 

< log 2 - inf ^ \Xl. - PPw{n, i)XH^ = S{Tr, p). 

Thus we have proved the following. 
Theorem 3.5.2. Almost surely, 

TV 

That is. the free energy of the Gaussian BK-GREM represents the free energy of 
an n level tree GREM. In fact, it represents the minimum out of all possible n\ many 
n-level Gaussian tree GREM energies with appropriately defined weights. 

Remark 3.5.1. A closer look of the definition of BJ^ reveals that if = for 
some s & S then such an s plays no role in the definition of /3iS. Moreover, since 

Eses (i^? - f^^sXs) = Ese5,a,^o (l^s - PasXs) + E,,e.9,a,=o 1^1 in calculating infi- 
mum of {^xl — PagXg) on ip we will be quite justified to put Xg — for all 

those s e 5" for which = 0. This will lead to the calculation of infimum of 
"^seSasf^o (i^s ~ PcLsXs) ou ^. In other words, we could consider 5" to be the col- 
lection of all those increasing sequences s for which ^ 0, instead of all sequences. 
This is the setup of the original Bolthausen-Kistler model. 

Bolthausen and Kistler have shown that the free energy is the minimum among 
the free energies of the tree GREMs associated with the all possible increasing chains 
of subsets of /. What the above argument shows is that one need not consider all 
chains. It is enough to consider n\ many n level GREMs. Can we reduce n\l Perhaps 
not in general. Incidentally, the argument also identifies all these n level GREMs 
which attains the minimum. In fact, the number of such n level GREM is precisely 
IVk], cardinality of Vk- 

Remark 3.5.2. Though the BK-model is not a hierarchial model, yet when the driving 
distribution is Gaussian we are not able to get out of the tree GREM. That is, the 
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tree GREM is in some way hidden in this model. 

Now one can raise the question whether going out of Gaussian driving distributions 
leads to a BK-GREM that is not usual tree GREM (in the sense of energy). In this 
regard, it is worth mentioning, that if we consider that the driving distributions {/U^} n 
satisfying LDP with rate function 2s{xs) = ^xj for some 7 > 1 and every s E S then 
Theorem 13.5.21 remains true. To see this we follow the same line of proof as above 
with the appropriate changes as done in section 2.6. 

Remark 3.5.3. Large deviation approach allows us to consider different driving distri- 
butions for different s G S". This can be done with BK-GREM also and one can prove 
the existence of free energy. But it is not easy to obtain explicit formula. 



3.6 Block Tree GREM 

In the previous section we have shown that in the Gaussian BK- Model the limiting free 
energy is the minimum over all possible n\ many n-level tree GREMs with appropriate 
weights. Now we will conclude this chapter by exhibiting one model which includes 
again n! many n-level GREMs and where the free energy is maximum over all those 
GREMs. To define the model we will use the notation n,N,a = o"i ■ ■ ■ (T„ with the 
same interpretation as that of the earlier section. Let be given non-negative 

weights. For any sequence s = (ji, ■ ■ ■ of distinct elements from / = {1,2,-- - ,n} 
and for any cr(s) = (0"^^, ■ ■ ■ , aj.) G 2'^*^-'^'^) x ■ ■ ■ x 2^^^^'^\ We have random variables 
^^(.^^ and these are independent A/'(0,A^). Now depending on vr, a permutation of / 
and a G 2^, we define the Hamiltonian as 

n 

i=l 

Note that here the configuration space has n\ x 2^ many points instead of usual 
2^ many. We call this model as Block tree GREM. We define the partition function 
corresponding to inverse temperature /3 > as 

and the definition of free energy is j^\ogZ]^{[3). 

So for = 3 the model will look like as in Figure 13. 1[ 



107 



Block Tree GREM 




Figure 3.1: Block Tree GREM 



Now for each tt e P/, let us denote 

Note that for each tt, ZJf denote the partition function for the n-level tree GREM 
with 2'^*^'^*^*^'^'' furcations below each of node at the {i — l)-th level of the tree and with 
the associated weight in the i-th level being ajr(i). So we can write 

Hence ^ log = ^ log E.^P, = ^ log max Z^P) + ^ log E.^P, ^^Sfk" 

Since limiting free energy exists almost surely corresponding to every tt, let us 
denote = lim^v ^ log ZJf{(3). Hence 

1 1 
lim — log Zn{(3) = lim — log max ZJf{(3) . 

N fs AT iv TT 

Now log being increasing function we can bring the max out side log and the range 
of TT being finite we can push the limit after max so that 

limi-logZ^(/3) = maxlimi-logZ]^(/3) = ma.x£^{P). 

N J\ IT N J\ TT 

Thus we have the following 
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Theorem 3.6.1. In the block tree GREM, the limiting free energy £[(3) exists almost 
surely and 

8 {(3) = maxf"(/3). 

TV 

Remark 3.6.1. In the definition of weights, we fixed numbers ai, ■ ■ ■ , a„ and weighted 
^^ia)'f^^-!(2)-^-f^l(i) with a,r(j)- Instead one could fix for each s, a sequence of distinct 
elements of I, a number and then ^al^lpll^r^alf^^^ could be weighted with a{7r(i),... ,7r(i)}- 
Different driving distributions for different s can also be considered. Then also the 
above theorem remains true. 

Remark 3.6.2. We now consider the weights (a^, s C J) as mentioned in the above re- 
mark. Using the notation of previous section, consider BK-GREM with these weights. 
Consider the GREM associated with vr G in the BK-GREM and denotes its energy 
by^(vr,/3). 

On the other hand, consider block tree GREM as mentioned in the above theorem 
with weights a{7r(i),... ,7r{j)}, where 



-^2 _ 2 

iMi),-,^W} - 2^ 

sC{7r(l),--- ,7r{i)} 
s^{7r{l),--- ,7r(i-l)} 



By fl3.5.9p . we observe that for each vr G Vi, a{7r(i),--- ,7r(i)} = w{TT,i) for 1 < i < n. 
Moreover, for a fixed n E Vi,m both the associated GREM model have 2'^*-'^*-*^'^-' many 
edges at the i-th level below each node of the (i — l)-th level. Hence = S{7i, P) 

for each vr G Vj. Thus the limiting free energy of this block tree GREM is larger than 
that of the BK-GREM. 
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Word GREM with External Field 



In this concluding chapter we discuss a more general version of random energy models, 
called Word GREM. This model includes Derrida's REM and GREM, also the model 
of Bolthausen and Kistler. Moreover the model is considered with external field. We 
apply this analysis to analyze the free energy of REM with external field. 



4.1 Word GREM 

In the previous chapters we have shown that the almost sure existence of the limiting 
free energy is assured through the simple LDP of certain empirical measures. This 
techniques is quite simple and neat. In this section, we present a general setup which 
includes all the models mentioned above. However, it is not just the generalization 
that should be noted. More importantly, we use the same large deviation technique 
which allows us to introduce external field in the model. To our knowledge these 
models are so far not discussed with external field except the REM by Derrida in 
[T6] . Not only that, as already mentioned in the previous chapter this method allows 
consideration of different driving distributions. This in turn leads to diverse covariance 
structures for the Hamiltonian. 



4.2 The Model 

Let / = {^1, <^2, ■ ■ ■ , ^n} be a set of n symbols where n > 1 is a positive integer. Let 
S{I) be the set of all words formed by these n symbols. Let S" be a finite subset of 
S{I). So a typical word s G S" of length / will look hke s = <^j^Q2 ■ ■ ■ Q, where each 
G /. Occasionally we will use the symbol s G S* as a word as well as a subset of 
/ consisting of all the symbols in s. Since symbols may be repeated in a word, it is 
possible that two different words may correspond to the same subset of /. Moreover, 
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without loss of generality we assume that each symbol appears in at least one word 
of S, that is IJ s = /. 

s&S 

For N > n, the N particle system has configuration space, as usual, Ejv = 
{+1,-1}^ consisting of sequence of length N with entries +1 and —1. For 1 <i <n, 

let k{i, N) >lhe integers with k{i, N) ^ N and ^ > as ^ oo. 

i=l 

Clearly, Ei Pi = 1- 

For a = ((7i, • • • , (Tat) G Sat, we denote — {ai : i < k{l, N)) , — 

((Tj : A^) + 1 < i < A^) + A;(2, iV)), etc. Thus o" can also be written as o" = 
((7"'^, ■ ■ ■ , cr"). For each s = QjQa ' ' ' ^ -S* and cr = {a^,- ■ ■ ,a"'), we put, 

n 

1=1 

For each s G 5 and a G S]v we have a random variables ^(s,ct(s)). These are 
assumed to be independent random variables (distributions in general depend on N.) 
To make it more precise, denote Sjjv = {+1, — l}'^'-'''^-', for 1 < i < n. For each 
s = ' ' '% ^ ^ "^("5) — {^^^ > ■ ■ ■ > "^'O ^ ^iiAf X ■ ■ ■ X Sjjjv, we have one random 
variable All these ^ 2'^(*'-^) random variables are independent. Let us 

assume, for s G 5, all the ^(s, cr(s)) have distribution on M, that is, the distribution 
of ^(s, (t{s)) depends on s but not on a{s). Let / : R'^ ^ R be a continuous function. 
For the configuration a — (iTi, (72, • • • , (Tn)-i the Hamiltonian of the system is defined 
as 

N 

H^ia, h) = Nfiiia)) + /i J] a^, (4.2.1) 

i=l 

where ^(cr) = (^(s, ^"(s)))^^^ and /i > is a number representing the intensity of the 
external field. The partition function of the system is 

a 

with /3 > being the inverse temperature. Once again the limiting free energy is 
lim^logZjv(/5). 

Remark 4.2.1. Observe that if S* = 6*1 consists of only one word ?i?2"""^rn and 
if f{x) — X then this is just the REM. If 5" = 5'„ consists of the n words 

{^1,^1^2,- •• ,^1^2 •••^n}, and if fiixs)s) = then this is just GREM. On 

seS 

the other hand, if S* = 5*21 consists of all the words Q1Q2 ' ' "^ii "with out repeti- 
tion of symbols then S can clearly be identified as the collection of non-empty sub- 
sets of S. If moreover, f{{xs)s) — ^ a-sXg then this will lead to the BK-GREM. 

s&S 

Of course, one could also take S — Sn\ consisting of all the n\.n many words 
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{^7r(i)^7r(2) ■ ■ ■ ?"7r(0 : 1 < / < & TT a permutatioii of {1, 2, ■ ■ ■ , n}}. 

Let a* denote the sum of the k{i,N) many +1 and —1 appearing in a*. In other 
words, if cr = (cti, (T2, ■ ■ ■ , (Tm) then a* is the sum of all aj where j satisfies k{l, N) + 
■ ■ ■ + k{i - 1, N) + 1 < j < k{l, N) + --- k{i, N). Then, note from fETD that 

i=l 

and 

Z^(/3,/.)=2^E.e-^'^^, (4.2.3) 

where E^r is the expectation with respect to the uniform probability on the configu- 
ration space. 

Under certain assumptions we shall show that the limit lim log 2'iv(/5) exists 

almost surely and is a non-random quantity. The essential assumptions are the fol- 
lowing: Firstly the distributions of ^ should have exponential decay and secondly 

k(i,N) 

' converges. 



Notations 

We start with some notations which we will use in the rest of the chapter. A typical 
points in M.^ x M" will be denoted by {{xs,s G S), {yi, i < n)) or simply as {Xg,yj). In 
what follows, □ = rises x YYi=i Vj is a box in x M" where for each s E S 
and Vj for i < are open subintervals of M. 

For A C I, let Sa = {s e S : s C A}. So note that Sj = S. We will denote 
Qan = IlseSA where Qsn = A^(As). If Sa is empty for some A, we put Qan = 1- 
Also we will denote Qsn with the same understanding as above considering s as a 
subset of /. Strictly speaking we should denote Qan and Qsn as Qan(0) and qsNi^s) 
respectively, but for ease of writing we are not doing so. If A = {^j^, <^j2, ■ ■ ■ 
then sometimes we need only the indices {ii, ^2, ■ ■ ■ ? "^m} and we will denote them by 

n 

[A]. For A C I, we denote k{A,N) = E ^(^> ^)l{fteA} = E Kh^) and Oan = 

i=l ' i(i[A] 

^k(A,N) E n lv,(f7)- We want to point out once again that time to time we will 
consider s G 5 as a subset of I. For example, if s = ^,^^2 ' ' ^ S, we will use the 
notation a^N = ^kkm E<x(s) 115=1 Iv,^ ( 1^ ) • 



4.3 A large deviation principle 

For each s G S*, let us consider a probability A* on M. If X is distributed like A'', let 
us denote As{p) = logEef^ and Da, = {p : A^(p) < oo}. Note that G Pa,, but 
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we want that G T^a^- So from now on we will focus our attention on those A* for 
which is an interior point in "Pa^. As G the mean Xg = J xdX'^{dx) exists 
and is finite quantity for each s G S*. Now if ■ ■ ■ are i.i.d. random variables 

having distribution A"*, we will consider A^ to be the law of + ^| + ■ ■ ■ + ^n)- 

Now by Cramer's theorem (Theorem lO.3.51) the sequence {A^} satisfies large deviation 
principle with a good, convex rate function X, given by X,(x) = supjpx — As(p)}. Note 



that this is a convex, good, non-negative lower semicontinuous function. Moreover, by 
property of good rate function, X,(xs) = for every s G 5* so that the set Is{x) < a 
is non-empty for every a > 0. We also want to point out that the functions T, are 
increasing on [0:5,00) and decreasing on (— cxo,^^]. 

Once again by Cramer's theorem, the arithmetic averages of i.i.d. mean zero, 
±1 valued random variables satisfy LDP with rate function Xq where To{y) = 00 for 
\y\ > 1; Xo(±l) = log 2 and for -1 < y < 1, 

^o{y) = ytanh"^ y — log cosh(tanh~^ y) 

= iog(i + y) + V iog(i - y)- ^^'^'^^ 

Let us define the map from S^v — > M'^ x M" as follows: 

a = (a\ a^ ■ ■ ■ , a") ^ U{s, a{s)), s e S) , 1 < t < n 



where a* is the sum of the entries of a*. 

Thus for each u (sample point of the random variables ^, which is suppressed so 
far), this map transports the uniform probability on Eat to M"^ x M". Denote this 
induced random probability by //at. Hence from (14.2.31) . we have. 



-logZ^(/3,/i) = log2--log / e V JdM^s,yi)- (4.3.2) 

Proposition 4.3.1. If for some A C I, J2N>n'^''^^'^^QAN<yAN < 00 then almost 
surely eventually ^n(0) = 0. 
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Proof. Let A be such that Eiv>n ^''^^'^^QAJvctAiv < oo. Then 

/^iv(n) = ^ E n i^s, am n Iv. (^) 

o- s&Sa ie[A] ^ ^ 

=ii^ E ni-.(«*.-w))ni'.(^) 



a':ie[A]seSA ie[A] 

As a consequence, 

p{n^{D) > 0) = p ( E n n (I) > 1 1 



<qan e n^^* 

(T»:ie[A] ie[A] 



The hypothesis and Borel-CanteUi lemma completes the proof. □ 
Let us note that, Efj,isf{n) — Qinckin- 

Proposition 4.3.2. If for all non-empty A C I, J2N>n'^~^^^'^^QAN'^AN < °° ^^^'^ 
for all e> a, almost surely eventually, 

(1 - e)QiNaiN < Aiiv(n) < (1 + (^)QiNOiiN- 



That is 



(1 - €)EnN{n) < /XAr(n) < (1 + €)EnN{n). 
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Proof. Note that 



a T \seS / i<n ^ ^ ^ ^ 

_ n2 ^2 

<^ E E E ^ (n ids, <t{s))) r(.)))] X 

Aci a Ti=cr,,V/:e[A] \se5 / 

n^.g)i„5) 



(since Qjj^ctjj^ cancels the terms corresponding to cxj 7^ r,, Vi G [/]) 



-i^Ef^E E n g) n 1. g) 1. (i) 

AC/ ^^'^ a Ti=o-i,Vje[A] je[A] ^ ^ ie[Ac] ^ ^ ^ ^ 



(by definition of Qan, A C I) 

Now by Chebyshev's inequahty for any e > 



p(|//jv(n) - EfiN{a)\ > eEfiNia)) < ^ E 



^2 ^^2''(^^^)QANaAN 

Once again Borel-CanteUi lemma and the hypothesis yield that a.s. eventually, 



(1 - e)EfXN{n) < //jv(n) < (1 + e)EfXN{n). 



Hence the proof. 



□ 
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Theorem 4.3.3. For a.e. uj, the sequence {/iAr(a;),A^ > n} satisfies LDP with rate 
function J given as follows: 



and 



[ teSA ie[A] ^ fce[A] J 

ses i6[/l 



= oo otherwise 

Proof. In what follows, A denotes a non empty subset of /. 

First of all note that, as Xq and Xg for s & S are convex, good rate functions, Vj 
is a convex compact set. 

Now let □ = rises ^ YYi=i open box in M.^ x M" where for each 

s E S and Vj for z < n are subintervals of M with rational end points. 
Step 1 Suppose that closure of □ is disjoint with Vj, that is Pj- fl □ = 0. In other 
words, for every {xs,yi) G □, there exists an A C /(depending on {xs,yi)) so that 
E 1t{xt) + E Pi-^o (~) > E Pfclog2. We shall show, almost surely eventually 
/Xiv(n) = 0. 

Note that as Xq and Ig are lower semicontinuous functions for every s G 5 and "Vj 
and are compact sets, we can get (x^, yj) G □ so that X(y°) = Xo(Vj) for 1 < -j < 
and X(x°) = Xs(As) for every s E S. 

For this point (x^, t/j) G □ there exists an A C / so that E ^t(^t) + 

E Pi^o ( ~ ) ^ E Pfc log 2. We will prove that for this A the hypothesis of Proposi- 

i&[A] Y ^-'^lA] 

tion l4.3.T] is satisfied and hence for this □ almost surely eventually /iAr(n) = leading 



to lim ^ logUAr(n) = — oo. 

Since {A^Iat satisfies LDP with rate function X,, we have 



limsup^logA^(A,) < -X,(AJ. 
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Let e > 0, to be chosen later. For all large N 
1 



N 



logA^(A,) < -J,(AJ + e = -J.(x°) +e, 



that is qsN = ^hi^s) < e ^(^"(^'^^ ^) eventually. And this is true for every s G Sa- So 
eventually 

Qan < e ^^'^ 

Similarly, the law of ^ satisfies LDP with rate pjXo(|^) and hence we will have 
eventually 

a AN < e '^f-*' 

Thus 



-TV E {pJo(^)-€) 



-N 

^'^^'''^QANaAN < e 



Now as — >. p. and we have strict inequality in ^ ^ti^t) + ^ Pilo (—j > 

t&SA ie[A] 

Pi log 2, we can choose an e so that 



J2 ^^^^'''^QANaAN < oo. 



N>n 



Hence by Proposition 14.3.1] we have, almost surely /iAr(n) = 0. It is not difficult to 
see now that almost surely /xat is eventually supported on a compact set. 
Step 2 Let us now consider a □ which has non-empty intersection with Vj. We 
show that for this □, almost surely. 
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.seS l<i<n 



< liminf — log/iAr(n) 

N^oo iV 



< limsup — log/iAr(n) 



(4.3.3) 



< 



,seS l<i<n 



Using LDP, we have 



liminf-logA^(A,) > -J,(A,). 



Hence for e > eventually, 



-logA^(A,)>-J,(A,)-6 



(4.3.4) 



for every s E S. Moreover, eventually. 



^,ogp(|,V.)>-p.I„(^l-. 



(4.3.5) 



for every i < n. Hence for every AC/, 



ie[Ai 



-N 



< e 



E ^log2- E (Xs(A,)+e)- E (pM^)+') 



ie[A] 



i6[Al 



by fHXil) and (031) . 

As P J is a convex set and □ is an non-empty open set, there exists at least one 
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Xg, Hj) in Vj n □, where 



l{x„y,): VAC J,^J,(x,) + $^p.Jo(-) < 5^p,log2 



Being a point in Vj, for every A C /, we have 

5^ J,(x°) + 5^p,Jo < 5^p.log2. 

That is 

s(^Sa i<n VP* / jgy^ 

The above being a strict inequahty, we can choose e depending on □ so that for every 
A C I the quantity 

is summable over A^. Now Proposition I4.3.21 yields (14.3.31) . This completes step 2. 

Towards step 3, let A be the collection of all open boxes □ with rational corner 
points satisfying either □nT'j = 0ornn Vj ^ 0. This collection is so rich that 
they form a base for the topology of M.^ x M^. Note that A being a countable family, 
out side a grand null set, for every □ in ^ conclusions of Step 1 and Step 2 hold. In 
the next two steps, we show 

J{xs,yi)= sup {-lim inf ^log/iAr(n)} (4.3.6) 

n:{xs,yi)en ^ 

= sup {-limsup ^log/iAr(n)}. (4.3.7) 

□:(xs,y/)en N iV 

Step 3 Let {xs,yi) ^ Vj. Then Vj being a closed set we can find a □ G ^ 
containing {xs,yi) so that □ does not intersect with Vj. By Step 1, /iAr(n) = 
eventually so that lim \og^j\f(D) = oo. Also by definition of J', we have J{xs^ yi) = 
oo. Hence the above equalities hold when {xs,yi) ^T^j- 
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Step 4 Now let (x^,?/?) G Vj and ^(^,^0) = {D e A : (x^y^j) E □}. Then, as 
observed in Step 2, for every □ G ^(^.o ,^0) we have eventually 



5^ J,(A,) + J2 pM-"^ 

,seS l<i<n 



< liminf — log/iAr(n) 

Af— >oo iV 

< limsup ^ log/iAr(n) 



< 



1_ 



l<i<n 



Pi 



(4.3.8) 



From the first part of the above inequality we have, 



iim^inf-iog/i^(n) > -j{n). 



And hence 



sup {- liminf- log /i;v(n)} < sup J(n) < (4.3.9) 

aeA,^o .,o^ ^ Dg^.^o „o, 



On the other hand, for every □ G using the right side inequality of fl4.3.8p . 

we have 

limsup^ log /i7v(n) < -J^(O). 

N 

Let A', (K = G ^ : A; > 1} be a subclass of A so that Dfc+i C Dfc for every 
k and flfcOfc = {(a^s, ?//)}• Then 



sup {-limsup ^ log /i AT (□)} > sup J{0) 

N 



> sup j^(n) 

= limj^(nfc) 



(4.3.10) 
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The last equality follows as JT" is a good lower semicontinuous function (see Proposition 

Em- 

Thus Step 3 and Step 4 complete the proof of fl4.3.6l) and fl4.3.7p . 
Now proof of Theorem 14.3.31 is completed by appealing to Proposition 10.3.31 and 
observing that {/iAr} is eventually supported on a compact set. □ 

Remark 4.3.1. A closer look at the above proof shows that the convexity of the rate 
functions Is is not an essential condition. Good rate functions on real line whose 
graph look like 'U' will suffice. That is we could take those non-negative functions 
X for which we will get at most two points x < a; so that X is zero on [x, x] ; strictly 
decreasing on (— oo,x]n{X G (0, oo)} and strictly increasing on [x, oo)n{X G (0, oo)}. 
With these conditions we can find (x^, y^) as stated in Step 1 and Step 2 for the proof 
to go through keeping all other Steps as it is. 

For each s, we started with a distribution A'^ and appealed to Cramer's theorem 
(see the first paragraph of this section). Instead, we could start with (A^, N > n) and 
assume that {A^} satisfies LDP with rate function X, having the properties mentioned 

in the above paragraph. 

Since almost every sequence of probabilities /iAr is eventually supported on a com- 
pact set, we could use Varadhan's lemma with any continuous function. This will lead 
to the following (see equation fl4.3.2p ): 

Theorem 4.3.4. In the word GREM, almost surely 




Though we have taken any real valued continuous function / on M , it is cus- 
tomary to consider f{xs) = (^s^s so that the Hamiltonian becomes II]\f{a) = 

s&S 

N as^{s,a{s)) + ha where a^, s G S* are non-negative weights and h > is the 

strength of the external field. It is also customary to consider Gaussian driving dis- 
tribution. In this setup if A* is standard normal, the above theorem will be applicable 
and will reduce to the following 
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Corollary 4.3.5. In the Gaussian word GREM with external field, almost surely, the 
limiting free energy is 

log 2 - inf I E ik^'s + PasXs) + t log 2^ + 2^ log £^ + | , 

where T>j is the set consisting of (x^,,?/^) G M"^ x M" such that \fA C I, 

E H + E (2^1og2^ + 2i^log£^) < E Pfclog2. 

sGSa ie[A] ^ ^ fee [A] 

Hence the use of large deviation techniques not only ensures the almost sure exis- 
tence of the limiting free energy, the calculation of free energy of the system is then 
reduced to that of an optimization problem. Of course, it is not always possible to 
solve this optimization problem to arrive at a closed form expression when the exter- 
nal field is present or different driving distributions are considered for different s G 5*. 
Even for n = 2 with Gaussian driving distribution it is difficult to obtain a closed 
form expression. The only case where we will get some 'closed' form expression is the 
case for n = 1 - that is the case of REM with external fields. This we will consider 
in the next section. But with no external field the situation is not that worse. In 
some of the cases, the method of calculation of the infimum will just reduce to what 
we did in section 13. 5[ In that case, though the model, to start with, was not an n 
level tree GREM, it reduced (as far as the free energy is concerned) to an n level tree 
GREM with appropriate weights [see §3.5] . It is quite conceivable that the present 
complicated model may always be equivalent to a tree GREM. We do not think so. 



4.4 REM with external field 

As mentioned in the last section, there is no general technique of obtaining a formula 
for the word GREM as well as tree GREM energy with external field. We will discuss 
here the simple REM with external field. Let us consider the word GREM where 
S consists of only one word, that is, S consists of the word <Ji<j2 where / = 

{^1, <j2, ■ ■ ■ , <^n} (see beginning of the previous section). In such a case the word GREM 
reduces exactly to the usual REM with external field. Thus the Hamiltonian is 

N 

HN{(T) = aNi, + h^a^ 
1=1 

where are i.i.d. random variables (for each fixed A^) and a are positive constants. 
Moreover for the Gaussian REM, are A^(0, -^). Then by Corollary 14. 3. 51 the limiting 
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free energy for the REM with external field exists almost surely and is given by 
£{P, h) = log 2 - inf log(l + + V log(l -y)+^{ax + hy) ] 

T>j K ) 

= log2 - inf |4 + Jo(y) + (5{ax + m| , 

where Xq is given by (14.3. ip and 

'^J = {(x, : f + log(l + + V log(l -y)< log 2} 
= {(x,i/):^+Jo(2/)<log2}. 

In other words, 

£(/3,/i)=log2-inf/(x,i/), 

where /(x, y) = | ^ + ^0(2/) — /5(aa; + | and V'j equals all points oiVj with both 
coordinates non-negative. 

To calculate the above infimum, first fix /?, /i and y with < ?/ < 1. Then the 
range of a; is < x < a/2 [log 2 — Xo(y)]- K is easy to see that if T^iy) < log 2 — |/3^a^ 
then the inf/(x, is attained for x = [3a and if Xo(?/) > log 2 — ^/5^a^ then the 

X 

infimum is attained for x = ■\/2[log2 — Xq (?/)]. Since Xq is a non-negative function, 
the set {1o{y) < log 2 — |/3^a^} will be non-empty only when j3 < ^a/2 log 2. For 
/3 > ^a/2 log 2, we always have Xo(|/) > log 2 — |/3^a^ so that the infimum is attained 
for X = a/2 [log 2 — Xo(?/)]. Substituting these values of x in f{x,y) we obtain the 
following expression for the infimum of f{x,y) over x. First we need a notation. For 
/? < log 2, let C/3 be the solution of 

XoM=log2-l/3V. (4.4.1) 

Then 



'gi{y) if^<^v/2bp andy<c^, 

^iy)= 4llf /(a;,y)= <!^72(2/) if/5<iv/2bp andy>c^, (4.4.2) 

92{y) if/5> iv/2bi2. 



0<x<A/2[log2-Xo(|/)] 



where 



gi{y) = ~f3^a^+My)-f3hy 



and 

^2(1/) = log 2 - /?av/2[log2-Xo(y)] - phy. 
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Since 



we have gi{0) = —[3h and 



g[{y) = tanh ^{y) - (3h, 



g'liy) =0^y = tanh(/?/i). 

On the other hand, as 

/3atanh-^(y) 
V2[log2 - Jo(y)] 

we have ^^(0) = -(3h and by fl4XTD . g'^ic/s) = -(3h + tanh-\c,3)- Thus g'^ic/s) < iff 
C/3 < tanh(/3/i). Moreover, 



/ / N < a tanh ""^(-w) < , 

g'M I ^ ^ r ! I ^• 



v/2[log2-Jo(y)] 
Let ?/o be the non-negative solution of 

atanh~^{y) 



v/2[log2-Jo(i/)] 



/i. (4.4.3) 



Such a solution always exists since log 2 — To(?/) — > as ?/ — > 1. 

Since Xq is a strictly increasing function of [0, 1], from equations (14.4.11) and (I4.4.3p . 
we note that 

tanh(/3/i) =yQ^yQ = cp. 

Now if /3 < i-\/2 log 2 and ?/o < C/? then tanh(/3/i) < yo < cp and the function 
in (14.4.21) is decreasing up to ?/ = tanh(/3/i) and then increasing. In such case, the 
inf ^{y) will occur aX y = tanh(/5/i) so that 

0<j/<l 

inf ip{y) = — ^/3^a^ — log cosh(/5/;.). 

0<2/<l 2 

On the other hand, if /3 < ^-\/2Tog2 and yo > cp then cp < y^ < tanh(/5/;,) and the 
function is decreasing up to ?/ = yo and then increasing. In such case, the inf ^{y) 

0<y<l 

will occur at y = 2/0 so that 



inf (/?(?/) = log 2 - /5aa;o - phyo, 

0<y<l 
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where xo = ^2[\og2 - Uy)] = ^il^ll^. 

Finally, if P > \\/2 log 2 then the function is decreasing up to ?/ = and then 
increasing. Hence in this case, the inf (^(y) will occur at y = y^ so that 

0<j/<l 

inf (p{y) = log 2 - paxo - phyo, 

0<2/<l 

where xo = V2[log 2 - Jo(y)] = ^il^ll^. 

We can summarize the above discussion in the following: 

Theorem 4.4.1. In the Gaussian REM with external field, the limiting free energy 
exists almost surely and given by 



S{P,h) 



log 2 + ^ + log cosh{(3h) ifl3< i V2bi2 and y^ < cp 
P {axo + hyo) otherwise, 



where yo be the non-negative solution of " ^^^^ = h, I^iy) = ytanh ^y — 

logcosh(tanh~^ y), cp is the solution oflQ{cp) = log 2 — |/3^a^ and Xq = — ^. 

Note that the case 'otherwise' in the theorem above consists oi [3 < \\J2 log 2 and 
> C/3 or if p > ^ V21og2. 

Theorem 14.4.11 provides yet another justification for the phase diagram (FIG. 3) 
in [161 of Derrida. 
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